Linux – Text processing commands
Linux/Unix offers many text processing tools which are very powerful and can be used together to process data and extract information from files. Some of these are listed below with few commonly used examples: head -- display first lines of a file tail -- display the last part of a file cat -- concatenate and print files less – paged output, with support to move forward and backward cut -- cut out selected portions of each line of a file sort -- sort lines of text files uniq -- report or filter out ......

Posted On Monday, February 18, 2013 1:07 PM

Can Vectorwise execute a query joining databases from multiple instances?
I was looking for a feature in Ingre Vectorwise similar to ‘Linked Server’ in MS SQL Server and ‘Database Link’ in Oracle. But unfortunately it seems Ingre Vectorwise 2.5 does not offer anything similar. The Ingres has ‘Register Table as Link’ (Ingres Star) which is currently not supported in Vectorwise. So I think the only available option as of today is to copy tables in one database on one server and then do the cross joins. Another way would be to create linked servers in MS SQL Server to different ......

Posted On Wednesday, February 13, 2013 9:15 AM

Ingres Vectorwise Client Tools
There are different tools available to connect to the Ingres Vectorwise database server. Actian (the parent company) offers following tools: 1. DBA Tools (current version is 10.1.0) 2. Director Client (current version is 1.0.0) These can be downloaded from With the server installation you will also get text based tools – sql and isql. Alternatively you can also use the open source project – Squirrel SQL Client (http://squirrel-sql.source... ......

Posted On Tuesday, February 12, 2013 12:27 PM

Support for LINQ in JAVA
I have been programming in C# and .NET for a decade now. I loved the concept of LINQ and Lambda Expression which brings a layer of separation between the user request to process a task and the real execution of the task which can be made to execute differently in different systems. This is specially beneficial in todays world with multi-core machines. This gives the power in hands of the execution engine to parallelize the queries as per availability of system cores. The JAVA community has realized ......

Posted On Monday, February 11, 2013 8:54 AM

Managing Remote Sessions on Linux
Recently I started working with Linux and the first day itself I felt a need to let the remote session continue while I wanted to disconnect my laptop and leave for home. Searching around for such a session management tool tool I found ‘Screen’ which is quite powerful and helped achieve my objectives. As defined by Adam Lazur at Screen is a terminal multiplexer that allows you to manage many processes through one physical terminal. Each process gets its own ......

Posted On Monday, February 11, 2013 7:47 AM

Internal.Job.Filename.Directory in Kettle
From last few days I was wondering why this internal variable (Internal.Job.Filename.Dire... is always empty in my development environment. The documentation at says these variable will be always defined. But it was not in my case. After lot of time spent debugging and breaking my head I found that these variables are set only if you do not use the repository. In my case I was using a file based repository which was causing this issue. By disconnecting ......

Posted On Friday, February 8, 2013 9:44 AM

Pentaho Data Integration (Kettle)–Continuous processing by implementing a loop
Many articles on implementing a loop in PDI suggest to put a ‘Wait For’ step and join the output hop back to a previous step. Something as suggested by below screen shot: But I would like to warn about this implementation of looping that this causes recursive stack allocation by JVM during job execution and the system may run out of memory after a high number of iterations (depending the system available available memory). In a test run on my laptop in which I continuously ran the above job with ......

Posted On Friday, February 8, 2013 7:45 AM

My introduction to Pentaho Data Integration – Kettle (ETL Tool)
Context: Recently I got a chance to get introduced to Kettle, due to a new project which requires ingesting huge and dynamic data files using PDI in a data warehouse. I have a good experience working with Microsoft technology stack - .NET Framework, C#, SQL Server, SSIS. The idea of open source has always been fascinating to me but I have not worked much with open source tools. So, working with PDI is quite a different experience for me. I liked the continuous evolution of PDI and the capabilities ......

Posted On Wednesday, February 6, 2013 12:23 PM

