Course Brochure
DATA WAREHOUSING
Overview
A Data Warehouse is a storage area, which holds the complete data specific to the
organization so that efficient queries can be built on it for Analysis and Reporting
purpose.Data Warehouse is a subject-oriented database where data is associated to a
single organizational process, often called entity. A Data Warehouse is integrated with
various source databases, which provides enormous strategic implication for Business
Intelligence.The Data Warehouse on the other hand does not cater to real time
operational requirements of the enterprise. It is a storehouse of current and historical data
extracted from either internal or external data source(s).
Prerequisites
Applications
Page 2
DATA WAREHOUSING
COURSE CONTENTS
What is RDBMS
Features of RDBMS
What is Data warehousing
Architecture of Data warehousing
Difference between RDBMS & DATA
WAREHOUSE DATABASES.
Definitions
ETL Process
Reporting
Types of Tables in D/W
Types of FACTS tables
Types of DIMENSION tables
Types of Schemas in D/W
What is Data Mart
Warehouse Approaches
INFORMATICA
Definitions
Informatica Tools
Informatica Architecture
Types of ports
Predefined functions
Types of Transformations:
Aggregator T/r
Connected Lookup T/r
Unconnected Lookup T/r
Connected Stored Procedure T/r
Unconnected Stored Procedure T/r
Normalizer T/r
Sql T/r
Transaction Control T/r
Xml T/r
Workflow Tasks:
Start Task
Session Task
Event Wait Task
Event Raise Task
Email Task
Decision Task
Assignment Task
Control Task
Command Task
Timer Task
Page 3
DATA WAREHOUSING
COURSE CONTENTS
Session Properties
Worklet
Types of Caches
Types of Partitions
Sequential Batch Processing
Parallel Batch Processing
ETL Testing
Performance Tuning
Optimization
Debugging
Backup Recovery
Installation steps
COGNOS
Types of tools:
Framework Manager
Creation of project
Creation of data source
Relations among tables
Resolving infinity loop
Applying filters
Data scrubbing
Data Cleansing
Handaling null values
Creation of Dimension objects
Creation of Measure objects
Creation of Package
Query Studio:
Introduction
Simple reports
Filters
Sort
Summarize
Format Data
Calculate
Define Custom Groups
Drill down
Drill Up
Goto
Charts
Define Conditional Styles
Apply Styles
Set web page size
Set page breaks
Group
Pivot
Sections
Swap rows and columns
Collapse group
Expand group
Page 4
DATA WAREHOUSING
COURSE CONTENTS
Report Studio:
Introduction
Simple reports
Query Calculations
Layout Calculations
Parameterized reports
Dynamic Headings in a report
Cascading reports
Reusability of a query
Master-Detail reports
2 Layouts with 2 Queries
2 Layouts with 1 Query
Grouped reports
Group filter reports
Page break reports
Applying sections
Removing sections
Calling one report from another report
Data format reports
Group Section reports
Creating headers
Merging data cells
Cross tab reports
Date Range prompt
Value prompt
Inline prompts
Repeater Table reports
Charts
Dash board reports
Analysis Studio:
Introduction
Summary reports
Ranks
Event Studio:
Introduction
Scheduling reports
DATA STAGE
Page 5
DATA WAREHOUSING
COURSE CONTENTS
Datastage Administrator:
Server properties
Datastage project Administration
Editing projects and Adding projects
Deleting projects
Cleansing up project files
Upgrade licences
Auto purging
Permissions to users
Runtime Column Propagation
Enable Remote Execution of Parallel jobs
Add checkpoints for sequencer
Project protect
.APT Config file
Datastage Director:
Info description
Difference between Compile and Validate
Difference between Validate and Run
Datastage Designer:
Page 6
DATA WAREHOUSING
DATA WAREHOUSING
COURSE CONTENTS
Join stage,Lookup stages
Difference between join and Lookup
stages
Merge stage
Difference between Lookup and Merge
stages
Remove duplicate stage
Sort stage,Pivot stage
Surrogate key stage, switch stage
Types of Lookups
Types of Transformer stages
Basic transformer stage
Transformer stage
Null handling in Transformer stage
If Then Else in Transformer
Stage variables
Constraints
Derivations
Peek stage, Head stage, Tail stage
Job properties
Local variables
Functions in Transformers
String,Date,Null handling functions
All properties in all stages
Slowly changing Dimensions (SCD)
SCD Type-1
SCD Type-2
SCD Type-3
Implementation of SCD T ype-1 in
Datastage
Implementation of SCD T ype-2 in
Datastage
JOB SEQUENCER:
CONTAINERS:
Reusability
Minimizing complexity
Local container
Shared container
Some jobs in container