Anda di halaman 1dari 32

Global Business Services

Informatica ETL Tool


- By Srini Veeravalli Siebel Analytics Factory

26th Sept, 2007

2007 IBM Corporation

Global Business Services

Informatica - ETL Tool

Synopsis:
Introduction to Informatica 7x Importance of Informatica for ETL Applications Informatica Architecture Informatica Client module Informatica Server Transformations in Informatica

Working with Workflow Manager


Working with Workflow Monitor Demonstration of Sample Informatica Mapping
2

2007 IBM Corporation

Global Business Services

Informatica Suite

Informatica PowerCenter

Informatica PowerMart

All product functionality, Includes all features except including the ability to register distributed metadata, multiple multiple servers. registered servers.
A PowerCenter lets you create a single repository that you can configure as a global repository.

2007 IBM Corporation

Global Business Services

Importance of Informatica for ETL Applications

Extraction, Transformation, Loading Can extract large volumes of data from multiple platforms, handle complex transformations on the data, and support high-speed loads. It can simplify and accelerate the process of moving data warehouses from development to test to production.

2007 IBM Corporation

Global Business Services

Informatica Architecture

Sources
PowerCenter and PowerMart access the following sources: Relational. File.

Application.
Mainframe. Other. Microsoft Excel and Access.

2007 IBM Corporation

Global Business Services

Informatica Architecture
Targets
PowerCenter and PowerMart can load data into the following targets: Relational.

File.
Application. Other. Microsoft Access. You can load data into targets using ODBC or native drivers, FTP, or external loaders.

2007 IBM Corporation

Global Business Services

Informatica Architecture
Informatica provides the following integrated components: Informatica repository. The Informatica repository is at the center of the Informatica suite. The Informatica Client and Server access the repository to save and retrieve metadata. Informatica Repository Server. The Informatica Repository Server manages connections to the repository from client applications. Informatica Client. Use the Informatica Client to manage users, define sources and targets, build mappings and mapplets with the transformation logic. Informatica Server. The Informatica Server extracts the source data, performs the data transformation, and loads the transformed data into the targets.

2007 IBM Corporation

Global Business Services

Informatica Architecture

Server

Source Analyzer

Sources

Target

Repository Manager
Designer

Warehouse Designer
TransformationDevelope r

Repository

Workflow Manager/Monitor Informatica Client

Mapping Designer Mapplet Designer

2007 IBM Corporation

Global Business Services

Connectivity

2007 IBM Corporation

Global Business Services

Informatica Client
Repository Manager. Use the Repository Manager to create and administer the metadata repository
Designer. Use the Designer to create mappings that contain transformation instructions for the Informatica Server. Source Analyzer. Import or create source definitions. Warehouse Designer. Import or create target definitions. Transformation Developer. Develop reusable transformations to use in mappings. Mapplet Designer. Create sets of transformations to use in mappings. Mapping Designer. Create mappings that the Informatica Server uses to extract, transform, and load data. Workflow Manager. Use the Workflow Manager to create, schedule, and run workflows. Workflow Monitor. Use the Workflow Monitor to monitor scheduled and running workflows for each Informatica Server.
10

2007 IBM Corporation

Global Business Services

Informatica Client Rep Server Admin Console

11

2007 IBM Corporation

Global Business Services

Informatica Client Rep Server Admin Console

Use the Administration Console to add repository configurations to the Console Tree. When you add a repository configuration, you can perform the following actions: Create a repository in a database. Change the Repository Server managing the repository. Upgrade an existing repository from an earlier version.

12

2007 IBM Corporation

Global Business Services

Informatica Client Repository Manager

13

2007 IBM Corporation

Global Business Services

Informatica Client Repository Manager


Repository Manager Tasks You can use the Repository Manager to perform the following tasks: Add a repository. You can add multiple repositories. Remove a repository. You can remove one or more repositories. Connect to a repository. You can connect to one repository or multiple repositories in a domain. Export and import repository connection information. You can export repository connection information from the client registry to a file. Truncate session and workflow log entries. You can truncate the list of session and workflow logs. Search for target definitions containing a keyword. You can use a keyword to search for a target definition. Search for repository objects. You can search for repository objects containing specified text or keywords.
14

2007 IBM Corporation

Global Business Services

Informatica Client Designer


The Designer has five tools to help you build mappings and mapplets so you can specify how to move and transform data between sources and targets. The Designer helps you create source definitions, target definitions, and transformations to build your mappings. The Designer allows you to work with multiple tools at one time and to work in multiple folders and repositories at the same time. Designer Tools The Designer provides the following tools: Source Analyzer. Use to import or create source definitions for flat file, XML, COBOL, Application, and relational sources. Warehouse Designer. Use to import or create target definitions. Transformation Developer. Use to create reusable transformations. Mapplet Designer. Use to create mapplets. Mapping Designer. Use to create mappings.
15

2007 IBM Corporation

Global Business Services

Informatica Client Designer Source Analyzer

16

2007 IBM Corporation

Global Business Services

Informatica Client Designer Source Analyzer


Importing Source / Target Definition from Data Base

17

2007 IBM Corporation

Global Business Services

Informatica Client Designer Warehouse Designer

18

2007 IBM Corporation

Global Business Services

Informatica Client Designer Mapping Designer

Output

19

2007 IBM Corporation

Global Business Services

Informatica Client Designer Windows


Designer Windows The Designer consists of the following windows:

Navigator. Use to connect to and work in multiple repositories and folders. You can also copy and delete objects and create shortcuts using the Navigator.
Workspace. Use to view or edit sources, targets, mapplets, transformations, and mappings. You can work with a single tool at a time in the workspace. Status bar. Displays the status of the operation you perform. Output. Provides details when you perform certain tasks, such as saving your work or validating a mapping. Overview. An optional window to simplify viewing workbooks containing large mappings or a large number of objects.
20

2007 IBM Corporation

Global Business Services

Transformations
Transformations are the manipulation of data from how it appears in the source system(s) into another form in the data warehouse.

This includes
Data merging: Process of standardizing data types and fields. Cleansing: This involves identifying any changing inconsistencies or inaccuracies. Eliminating inconsistencies in the data from multiple sources. Converting data from different systems into single consistent data set suitable for analysis. Aggregation: The process where by multiple detailed values are combined into a single summary value typically summation numbers representing dollars spend or units sold.

21

2007 IBM Corporation

Global Business Services

Transformations in Informatica
Aggregator Transformation: The Aggregator transformation allows you to perform aggregate calculations, such as averages and sums.

Expression Transformation: Expression transformation to calculate values in a single row before you write to the target. You can use the Expression transformation to perform any non-aggregate calculations.
Advanced External Procedure Transformations : Advanced External Procedure transformations operate in conjunction with procedures you create outside of the Designer interface to extend PowerCenter/PowerMart functionality. External Procedure Transformations : External Procedure transformations operate in conjunction with procedures you create outside of the Designer interface to extend PowerCenter/PowerMart functionality. Filter Transformation : Filter transformation provides allows you to filter rows in a mapping. Rank Transformation: Allows you to select only the top or bottom rank of data. Router Transformation: Is similar to Filter transformation but it used for two or more filter conditions.
22

2007 IBM Corporation

Global Business Services

Transformations in Informatica
Joiner transformation : Joiner transformation joins two related heterogeneous sources residing in different locations or file systems. The combination of sources can be varied. You can use the following sources:

Two relational tables existing in separate databases


Two flat files in potentially different file systems Two different ODBC sources Two instances of the same XML source

A relational table and a flat file source


A relational table and an XML source Lookup transformation: Lookup transformation in your mapping to look up data in a relational table, view, or synonym.

Normalizer Transformation: Normalization is the process of organizing data. use the Normalizer transformation with COBOL sources, which are often stored in a denormalized format
Sequence Generator Transformation: Is used for generates numeric values Stored Procedure Transformation: Is an important tool for populating and maintaining databases.
23

2007 IBM Corporation

Global Business Services

Transformations in Informatica
Sorter transformation Transformation : Sorter transformation allows you to sort data. You can sort data from a source transformation in ascending or descending order according to a specified sort key. Source Qualifier Transformation : Source Qualifier represents the rows that the Informatica Server reads when it executes a session. The Source Qualifier displays the transformation datatypes. The transformation datatypes in the Source Qualifier determine how the source database binds data when the Informatica Server reads it. XML Source Qualifier Transformation: When you add an XML source definition to a mapping, you need to connect it to an XML Source Qualifier transformation. Update Strategy Transformation: To update the target based on the flag values.
24

2007 IBM Corporation

Global Business Services

Work Flow Manager

25

2007 IBM Corporation

Global Business Services

Work Flow Manager


Workflow Manager Tools The Workflow Manager consists of three tools to help you develop a workflow: Task Developer. Use the Task Developer to create tasks you want to execute in the workflow. Workflow Designer. Use the Workflow Designer to create a workflow by connecting tasks with links. You can also create tasks in the Workflow Designer as you develop the workflow. Worklet Designer. Use the Worklet Designer to create a Worklet.

26

2007 IBM Corporation

Global Business Services

Work Flow Manager Server Editor

27

2007 IBM Corporation

Global Business Services

Informatica Server
The Informatica Server moves data from sources to targets based on workflow and mapping metadata stored in a repository. A session is a type of workflow task. A session is a set of instructions that describes how to move data from sources to targets using a mapping. When a workflow starts, the Informatica Server retrieves mapping, workflow, and session metadata from the repository to extract data from the source, transform it, and load it into the target. The Informatica Server uses the following processes to run a workflow:

The Load Manager process. Starts and locks the workflow, runs workflow tasks, and starts the DTM to run sessions.
The Data Transformation Manager (DTM) process. Performs session validations. Creates threads to initialize the session, read, write, and transform data.
28

2007 IBM Corporation

Global Business Services

Work flow Monitor


Navigator Window Time window Out put window

29

2007 IBM Corporation

Global Business Services

Work flow Monitor


Workflow Monitor is a tool that allows you to monitor workflows and tasks. You can view details about a workflow. You can run, stop, abort, and resume workflows from the Workflow Monitor.
The Workflow Monitor consists of the following windows: Navigator window. Displays monitored repositories, servers, and repositories objects. Output window. Displays messages from the Informatica Server and the Repository Server. Time window. Displays progress of workflow runs. Gantt Chart view. Displays details about workflow runs in chronological (Gantt Chart) format. Task view. Displays details about workflow runs in a report format, organized by task, folder, or status.
30

2007 IBM Corporation

Global Business Services

Questions ? ?

31

2007 IBM Corporation

Global Business Services

Thank You

32

2007 IBM Corporation

Anda mungkin juga menyukai