Focus on high risk problems using limited resources Emphasis on Data Quality
Relevance Completeness Correctness Consistency
10
11
12
Each target had a similar test approach The test backlog was a prioritized list of these tests Detailed test scripts are expensive to produce Our scripts outlined a guided exploration Progress could be measured through a burndown chart Regulatory requirements needed to be met
13
14
15
Tests
We tested for completeness
No missing records No missing fields
16
Test Points
Test every ETL, Fact, and Dimension
17
18
Test Results
Greater than 99.99995% data accuracy Testing less than 20% of development effort Common scripts, common understanding
19
Development Standards Issues 23% Implementation Errors ETL Errors 22% 21%
Database Issues
Design Issues
13%
9%
Other Issues
12%
20
Bad data
Missing data ETL errors Counts off Totals off
Failed calculations
Failed conversions Unpopulated fields
Copyright Ideaca 2008 21
22
23
Conclusions
Value based approach focused our test efforts to find more serious problems sooner Applying agile principles allowed us to minimize wasted time and effort Testing identified development process changes that had the greatest impact on data quality New regulatory requirements mean that the ability to test is now a design issue
24
Focus on tool database, data warehouse Focus on process tables, views, stored procedures Test plans
Test cases Detailed scripts for instructions
Focus on value data usage in business context Focus on outcome stars/dimensions/facts Test backlogs
Test targets Light scripts as guides for exploration Team communication is vital
25