over the last month The ten most popular programs over the past week The top 20% spending customer over the past quarter
the business process - fact tables. Determine the key dimensions that apply to each fact Check that a candidate fact is not actually a dimension with embedded facts Check that a candidate dimension is not actually a fact table within the context or DS requirement
customer analysis customer events (disconnection, payment) Eg Banking customer profiling customer events customer profitability account transactions
as fact table Ques analyze account transactions by account or how customer use service If focus is analysis of customer usage then dimension customer entity
address entity could be mistaken for a fact table More appropriate fact table no of operational events occurred at the specific address
Fact In a customer profiling or customer marketing database, it is probably a fact table Dimension In a retail sales analysis data warehouse, or any other variation customer is used as basis for analysis If a dimension can be viewed by more than three entities its a fact
business - minimum retention period for each detail Draw retention period graph showing detail necessary for each business function Eg Retail Sales analysis 3 months details Life style profiling - 6 months weekly
detailed fact data consider storing samples and aggregate the rest Eg retail sales analysis samples spot trends across all stores Not suitable to determine product buying patterns in all stores located at seaside
contains 3.65 billion rows 2 million customers, with 2.5 transactions per day per customer, 2 year retention period saving of 10bytes per row 10X3.65 billion bytes= 33.99GB
to the intelligent key the query can be satisfied by the fact table alone Diasadv If any of the identifier changes fact table will have to be updated
compared to reference to a time dimension table Storing an date offset from inherent start of the table (week, month, quarter) Storing a date range 7. Partition the fact table
table May not be appropriate in situation where additional data is not accessed very often
largest no of queries
products Use queries that compare facts within a grouping that exists at present with grouping that existed in the past It is necessary to store date ranges on the dimension table