Ronald Hochreiter
VO Datenbanksysteme, UK Datawarehousing
December 13, 2006
Market Basket Analysis
2
Market Basket Analysis - Data Description
• Southern Italy
3
Market Basket Analysis - Raw Data
4
Market Basket Analysis - Datawarehouse
5
Market Basket Analysis - SQL Data Analysis
Main problem solved, however there may be troubles with the data:
Some purchases have not been split up correctly, one purchase with
131 items, next: 66, 50, 37, 34, 33, . . .
Quick (and dirty) solution: Remove all purchases, where more than
30 items have been bought together?
8
Market Basket Analysis - Buy-Together Analysis
11
Market Basket Analysis - Association Rules
support {A → B} = NA→B
N
NA→B =
= 1119 and N =
SELECT count(*)
FROM transaction2
GROUP BY purchase
12
Market Basket Analysis - Association Rules
confidence {A → B} = NN
A→B
A
SELECT count(product)
FROM transaction2
WHERE product = ’coke’
GROUP BY purchase
13