Anda di halaman 1dari 9

OLAP(Online Analytical Processing)

OLAP stands for Online Analytical Processing. It uses database tables (fact and dimension tables) to enable multidimensional viewing, analysis and querying of large amounts of data. OLAP technology could provide management with fast answers to complex queries on their operational data or enable them to analyze their company's historical data for trends and patterns. Online Analytical Processing (OLAP) applications and tools are those that are designed to ask complex queries of large multidimensional collections of data. Why we Need OLAP multidimensional nature of the business problem. very large number of records that can reach gigabytes and terabytes summarizing this data into a form of information that can by used by business analysts. limitations of SQL that, it cannot represent these complex problems. SQL is not able to use mathematical models in these SQL statements

OLAP Operations 1.Roll-up Performs aggregation on a data cube, either by climbing up a concept hierarchy for a dimension or by dimension reduction 2.Drill-down ..Can be realized by either stepping down a concept hierarchy for a dimension or introducing additional dimensions 3.Slice and Dice .Slice performs a selection on one dimension of the given cube, resulting in a sub cube Dice defines a subcube by performing a selection on two or more dimensions

4.Pivot (rotate) .Its a visualization operation that rotates the data axes in view in order to provide an alternative presentation of the data Categories of OLAP Tools MOLAP( Multidimensional OLAP) This is the more traditional way of OLAP analysis. In MOLAP, data is stored in a multidimensional cube. The storage is not in the relational database.That is, data stored in array-based structures.

Advantages: Excellent performance: MOLAP cubes are built for fast data retrieval, and are optimal for slicing and dicing operations. Can perform complex calculations: All calculations have been pre-generated when the cube is created.

Disadvantages: Limited in the amount of data it can handle Requires additional investment

Examples: Hyperion Essbase, Fusion (Information Builders)

ROLAP(Relational OLAP) This methodology relies on manipulating the data stored in the relational database to give the appearance of traditional OLAP's slicing and dicing functionality. In essence, each action of slicing and dicing is equivalent to adding a "WHERE" clause in the SQL statement. Data stored in relational tables

Advantages: Can handle large amounts of data Can leverage functionalities inherent in the relational database

Disadvantages: Performance can be slow Limited by SQL functionalities

Examples: Microstrategy Intelligence Server, MetaCube (Informix/IBM)

HOLAP (MQE: Managed Query Environment) HOLAP technologies attempt to combine the advantages of MOLAP and ROLAP. For summary-type information, HOLAP leverages cube technology for faster performance. It stores only the indexes and aggregations in the multidimensional form while the rest of the data is stored in the relational database.

Examples: PowerPlay (Cognos), Brio, Microsoft Analysis Services, Oracle Advanced Analytic Services

OLAP Guidelines /Rules for OLAP Dr. E.F. Codd the father of the relational model, created a list of rules to deal with the OLAP systems. Users should priorities these rules according to their needs to match their business requirements. 1. Multidimensional conceptual view: The OLAP should provide an appropriate multidimensional Business model that suits the Business problems and Requirements. 2. Transparency: The OLAP tool should provide transparency to the input data for the users. 3. Accessibility: The OLAP tool should only access the data required only to the analysis needed. 4. Consistent reporting performance: The Size of the database should not affect in any way the performance. 5. Client/server architecture: The OLAP tool should use the client server architecture to ensure better performance and flexibility. 6. Generic dimensionality: Data entered should be equivalent to the structure and operation requirements. 7. Dynamic sparse matrix handling: The OLAP too should be able to manage the sparse matrix and so maintain the level of performance. 8. Multi-user support: The OLAP should allow several users working concurrently to work together. 9. Unrestricted cross: dimensional operations: The OLAP tool should be able to perform operations across the dimensions of the cube. 10. Intuitive data manipulation

11. Flexible reporting: It is the ability of the tool to present the rows and column in a manner suitable to be analyzed. 12. Unlimited dimensions and aggregation levels: This depends on the kind of Business, where multiple dimensions and defining hierarchies can be made. In addition to these guidelines an OLAP system should also support: Comprehensive database management tools: This gives the database management to control distributed Businesses The ability to drill down to detail source record level: Which requires that The OLAP tool should allow smooth transitions in the multidimensional database. Incremental database refresh: The OLAP tool should provide partial refresh. Structured Query Language (SQL interface): the OLAP system should be able to integrate effectively in the surrounding enterprise environment.

Steps for Implementing OLAP Here are the major steps: Dimensional modeling Design and building of the MDDB Selection of the data to be moved into the OLAP system Data acquisition or extraction for the OLAP system Data loading into the OLAP server Computation of data aggregation and derived data Implementation of application on the desktop Provision of user training

Difference Between OLAP and OLTP

Different Types of Reports Standard versus Ad-hoc Condesed versus Detailed Replication in the Data Warehouse versus Direct Access

Standard versus Ad-ho Standard Many reports are executed not once, but many times. Therefore, they should not have to be recreated each time. Instead, it should be possible to re-access templates and fill them automatically with the new data

Ad-hoc However, if users would like a general view of certain facts for which there no pre-defined report exists, they need a tool that enables them to quickly compile this information themselves.

It is important that they can quickly find the data they need. They need a tool that provides a business-operations view of the relevant data fields - characteristics, indicators and other attributes - and their links. This tool must be clear, and must help to find the fields needed by filtering and sorting.

Condensed versus Detailed Condensed.In the case of reports with condensed data the main focus is on data that can be aggregated. Usually the top-down method is used, that is, users start at a relatively high level of condensing and then display successively more details for important data (zoom in or drilldown). Experts also change (navigate) between different views (slice and dice). Detailed..The other approach is the bottom-up approach. This is used mostly for reports that either contain hardly any data fields that can be aggregated, or for those in which the main focus from the start is on a detailed and complete overview of all selected data. To get an overview of the structure, particularly with large quantities of data, grouping and highlighting with color and font attributes are used. Replication in the Data Warehouse versus Direct Access Replication in the Data Warehouse A data warehouse has clear advantages in reporting: Data can be accessed more quickly because it has been stored in the way best suited to analysis. Not only are the system parameters for displaying data optimized, but suitable indexes, aggregates and joins are also stored in the database. Historical data can also be accessed and consistent analyses using several operative systems and even external data (for example via the Web) are possible. Direct Access In the case of large amounts of detailed data, the decision has to be made as to whether the data has to be replicated in its entirety in the data warehouse, or simply in condensed form. If users have to create up-to-date reports, they can still access the data in the operative system, since usually data is replicated in the data warehouse only at certain intervals.

Advantages of OLAP Enables analysts, executives, and managers to gain useful insights from the presentation of data. Can reorganize metrics along several dimensions and allow data to be viewed from different perspectives. Supports multidimensional analysis. Is able to drill down or roll up within each dimension.

Is capable of applying mathematical formulas and calculations to measures. Provides fast response, facilitating speed-of-thought analysis. Complements the use of other information delivery techniques such as data mining. Improves the comprehension of result sets through visual presentations using graphs and charts. Can be implemented on the Web. Designed for highly interactive analysis. Features of OLAP

Anda mungkin juga menyukai