Posts

Penataho Data Integration (PDI) - ETL

Image
What is Pentaho Data Integration (PDI) or Kettle or Spoon Pentaho data integration (PDI) is a (ETL) tool which allows you to read a data from multiple source, extract or transform or process the data and push it into another form. Spoon is the tool which is used to design the transformation and execute the transformation. There are two basic types of activities can be done: 1. Transformations - .ktr 2. Jobs - .ktj Transformations is the data flows for ETL such as reading from a various files (ex: xml, JSON, csv etc), transforming data and loading it into a target location(ex:database). On the other hand Jobs uses transformation workflow in sequence by defining the which should execute first, second etc., or execute a transformation by various validation steps and execute it. Ex: "If source file is JSON execute these steps" Spoon is the tool which is used to develop the transformation. Pentaho CE (community edition) can be downloaded from below link and it is free to ...