Posts

Showing posts from 2014

Things learned other than Pentaho

28 Jan 2014 CREATE DATABASE LINK (Oracle) Sample query select A.column1 from TABLEA A where not exists (select 1 from TABLEA, schemaX.TABLEZ Z where A.Column1=Z.Column2) JAVA SCRIPT Example var str = " Hyderabad "; var result=  str.substr(1,4)  So  result  is  yder Format str.substr(Index Number,# of characters from Index) -- CHECK FOR NUMERIC DATA replace( translate( rtrim(ltrim(FIELD_NAME)), '.0123456789', '00000000000' ), '0', '' ) is  null  condition will identify the data (numeric data only)  replace( translate( rtrim(ltrim(FIELD_NAME)), '.0123456789', '00000000000' ), '0', '' ) is not null  condition will identify the data (contains non-numeric data) that require cleansing.  

Pentaho database join step

Image
Data Warehousing Pentaho: (Year: 2014 ) Working Database join step This will not work you cannot replace table name and field name using parameters of database join step Instead you have to create 2 transformations. 1 for setting variables and 2 for using them.

Pentaho ETL

Image
Pentaho  Here, I am trying to write down my finding in Pentaho (Spoon). So that, I can quickly look back if I need to. This is more like a personal note and as of now, I am not sure if this could be of any help to others. 07 Feb 2014 ERROR HANDLING How to enable it? You can right click on the steps and you can see an option called Error Handling. Note: Not all the steps have this option. Error handling is disabled for Table input step, stream look-up step etc.. How to define the field names? You can give the names you are interested in for the predefined fields of error handling. So, these fields will be added to the fields flowing in your current stream Currently, for the Pentaho 5.0, we are having a bug for ERROR_COUNT field, which is getting type as Long instead of Integer for some of the rows. So, as a temporary fix, we have used javascript to handle this public boolean processRow(StepMetaInterface smi, StepDataIn...