A producer wants to
know.
Which
Whichare
areour
our
lowest/highest
lowest/highestmargin
margin
customers
customers??
What
Whatproduct
productprompromotions have
-otions
havethe
thebiggest
biggest
impact
impacton
onrevenue?
revenue?
Who
Whoare
aremy
mycustomers
customers
and
andwhat
whatproducts
products
are
arethey
theybuying?
buying?
What
Whatisisthe
themost
most
effective
effectivedistribution
distribution
channel?
channel?
What is a Data
Warehouse?
What is Data
Warehousing?
A process of
transforming data
into information
and making it
available to users in
a timely enough
manner to make a
difference
Information
Data
Data Warehouse
Subject Oriented
Integrated Data
Time-Variant Data
Nonvolatile Data
1. Subject Oriented
2. Integrated Data
3. Time-Variant Data
Allows for analysis of the past
Relates information to the present
Enables forecasts for the future.
4. Nonvolatile Data
Data Granularity
Data Marts
Top-Down Approach
Bottom-Up Approach
OLTP
Operational processing
Transaction
Clerk, DBA, database professional
OLAP
Informational processing
Analysis
Knowledge worker(e.g. managers)
Function
Long-term informational
requirements, decision support
DB Design
ER based, application-oriented
Star/Snowflake, subject-oriented
Data
Summarization
View
Unit of Work
Access
Focus
Operations
DB Size
Priority
Summarized, consolidated
Summarized
Complex query
Mostly read
Information out
Lots of scan
100 Gb to Tb
High flexibility, end-user autonomy
Metric
Number of Users
Transaction throughput
Thousands
Query throughput
Hundreds
Semistructured
Sources
www data
IT
Users
Archived
data
Extract
Transform
Load
(ETL)
MOLAP
Clients
(Tier 3)
Query/Reporting
Meta
Data
Data
Warehouse
Operational
Data Bases
Data sources
OLAP Servers
(Tier 2)
Analysis
Business
Users
Data Mining
ROLAP
Data Marts
Tools
Business Users
16
Data Warehouse
Architecture
Client
Client
Query & Analysis
Metadata
Warehouse
Integration
Source
Source
Source