Anda di halaman 1dari 17

Data Stage Overview

Objectives

 Describe a data warehouse or data mart


 Enterprise Data Integration
 Describe DataStage
 History of Datastage
 Identify the server and client components of
DataStage
 Describe DataStage projects
 Describe DataStage jobs
 Identify the steps for designing a DataStage job
What is a Data Warehouse?

 Repository of data
 Optimized for report generation
 Supports business analysis
 Projections
 Comparisons
 Assessments
 Extracted from operational sources
Integrated Summarized Filtered
Cleansed De-normalized Historical
Data Marts

 Like data warehouses but smaller in scope


 Organize data from a single subject area or
department
 Solve a small set of business requirements
 Cheaper and faster to build
Enterprise Data-Integration
DataStage

 With DataStage you can:


 Design jobs that extract, integrate, aggregate, transform
data
 Create, manage, and reuse metadata
 Run, monitor, and schedule jobs
 Manage your development environment
History Of Datastage

 Datastage was started in 1997 by company called


V-Mark.
 Later was taken over by Ardent , which in turn was
taken over by Informix.
 Current release is Datastage 6 (Viper) from
Ascential Software.
DataStage Application Components
DataStage Administrator

User
privileges

Connection
License timeout
info
DataStage Administrator

Permissions

Job
scheduling
User
privileges
DataStage Director

 Validate jobs
 Run jobs
 Monitor jobs
 Schedule jobs
 Gather statistics
DataStage Designer

 Specify extraction, transformation


 Denormalize (decode) data
 Aggregate data
 Split data
DataStage Manager

 Store metadata
 Reuse metadata
 Define routines
Development in DataStage

 Define project properties: Administrator


 Open project
 Design jobs: Designer
 Import metadata: Manager
 Define extractions, data flows, integrations
 Define transformations, constraints, aggregations
 Define loads
 Compile and debug jobs: Designer
 Run and monitor jobs: Director
DataStage Projects

 Created during installation


 Associated with a directory
 Must attach to
 Self-contained
 Multiple users can be working at the same time
DataStage Jobs
Compile
Debug
Passive
stage Active
stage

Lookup

Link

Anda mungkin juga menyukai