Sunita Sarawagi
Monowar Hossain
York University
Agenda
Requirements on Indexing methods
Existing indexing methods
Optimization of R-Tree for OLAP data
R-Tree VS Bit-mapped Indices
Conclusion
Indexing
Pre-computation group-bys
Indexing summary data
Handing
Existing methods
Multidimensional
array-based methods
mapped indices
Pros:
Low cardinality data, bit maps are both spaced and retrieval
efficient.
Supports bitwise operations
Access data is clustered
All dimensions handles symmetrically
Cons
Range queries
Increased space overhead of storing the bit-maps specially for
high cardinality data
Expensive batch update as all bit mapped indices have to be
modified even for a single row insertion
Indices
Pros:
Cons:
spatial data
dense regions
Ask Expert?
Use of clustering algorithm (similar algorithm: image
analysis)
Need
evaluation!!
Bit-mapped
Pros:
Conclusion
High level overview
Recommended readings
MOLAP VS OLAP
R-Tree and variants
R-Tree alternatives
Computational of multidimensional aggregates
And More..