Saturday, 9 April 2011

What is ETL and why Informatica ?

Before diving into Informatica lets do some formalities and talk about ETL and why to 
use Informatica. We are not going to talk about data warehousing and its fundamentals here as Mr. Google is always there for you. But if you still have any questions on data warehousing you can always ask here.
ETL is an abbreviation of the three words Extract, Transform and Load. It is an ETL process to extract data, mostly from different types of system, transform it into a structure that's more appropriate for reporting and analysis and finally load it into the database. 

ETL process


There are lot of ETL tools in market some paid some open-source. a few to name :
    Comercial ETL Tools:
  • IBM Infosphere DataStage
  • Informatica PowerCenter
  • Oracle Warehouse Builder (OWB)
  • Oracle Data Integrator (ODI)
  • SAS ETL Studio
  • Business Objects Data Integrator(BODI)
  • Microsoft SQL Server Integration Services(SSIS)
  • Ab Initio
    Freeware, open source ETL tools:
  • Pentaho Data Integration (Kettle)
  • Talend Integrator Suite
  • CloverETL
  • Jasper ETL
Now you will probably ask then why Informatica when so many of them are available. The answer is very simple because I feel comfortable with it and Informatica PowerCenter integrates data from virtually any business system, in any format, and delivers that data throughout the enterprise at any speed to improve operational efficiency. Highly available, high-performance, and highly scalable, the software serves as the foundation for all enterprise data integration projects.Informatica has a simple visual interface. It works mainly on Drag-Drop approach.