Anda di halaman 1dari 22

Data Warehouse

Presentation Prepared by:


Vikram

A producer wants to
know.
Which
Whichare
areour
our
lowest/highest
lowest/highestmargin
margin
customers
customers??

What
Whatproduct
productprompromotions have
-otions
havethe
thebiggest
biggest
impact
impacton
onrevenue?
revenue?

Who
Whoare
aremy
mycustomers
customers
and
andwhat
whatproducts
products
are
arethey
theybuying?
buying?

What
Whatisisthe
themost
most
effective
effectivedistribution
distribution
channel?
channel?

What is a Data
Warehouse?

A single, complete and consistent


store of data obtained from a
variety of different sources made
available to end users in a what
they can understand and use in a
business context.

What is Data
Warehousing?

A process of
transforming data
into information
and making it
available to users in
a timely enough
manner to make a
difference

Information

Data

Data Warehouse
Subject Oriented
Integrated Data
Time-Variant Data
Nonvolatile Data

1. Subject Oriented

2. Integrated Data

3. Time-Variant Data
Allows for analysis of the past
Relates information to the present
Enables forecasts for the future.

4. Nonvolatile Data

Data Granularity

Data Marts

Top-Down Approach

Bottom-Up Approach

DBMS Vs. Data Warehouse


Feature
Characteristics
Orientation
User

OLTP
Operational processing
Transaction
Clerk, DBA, database professional

OLAP
Informational processing
Analysis
Knowledge worker(e.g. managers)

Function

Day-to Day operations

Long-term informational
requirements, decision support

DB Design

ER based, application-oriented

Star/Snowflake, subject-oriented

Data

Current; guaranteed up-to-date

Historical; accuracy maintained over


time

Summarization
View
Unit of Work
Access
Focus
Operations
DB Size
Priority

Primitive, highly detailed


Detailed
Short, simple transaction
Read/write
Data in
Index/hash on primary key
100 Mb to Gb
High performance, High availability

Summarized, consolidated
Summarized
Complex query
Mostly read
Information out
Lots of scan
100 Gb to Tb
High flexibility, end-user autonomy

Metric
Number of Users

Transaction throughput
Thousands

Query throughput
Hundreds

Data Warehousing Components


Data
(Tier 0)

Data Warehouse Server


(Tier 1)

Semistructured
Sources

www data

IT
Users

Archived
data

Extract
Transform
Load
(ETL)

MOLAP

Clients
(Tier 3)
Query/Reporting

Meta
Data

Data
Warehouse

Operational
Data Bases

Data sources

OLAP Servers
(Tier 2)

Analysis

Business
Users
Data Mining

ROLAP

Data Marts

Tools

Business Users

16

Defining the Business


Requirements for Data Warehouse

Consider, we want to create operational system


for order processing department of a company.
Users can easily define the requirements as:

How they receive the orders


Check stock
Verify customers credit arrangements
Price the order
Determine shipping arrangements
Route the order to the appropriate warehouse
GUI they use for processing
How and when they use the application

Dimensional nature of Business


data

Even though the users cannot fully describe what


they want in a data warehouse, they can provide
you with very important insights into how they
think about the business.
They can tell you what measurement units are
important for them.
Each user department can let you know how they
measure success in that particular department.
The users can give you insights into how they
combine the various pieces of information for
strategic decision making.

From Requirement to Data


Design

Data Warehouse
Architecture
Client

Client
Query & Analysis

Metadata

Warehouse

Integration

Source

Source

Source

3-Tier Architecture of Data


Warehouse

Anda mungkin juga menyukai