BY Poonam Bhargav
Outline
Why data mining?
What is data mining? KDD and Data Mining
Also known as Knowledge discovery(mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intelligence, etc.
3
more than 30 movies in the last year with those whose rental account is lower
than 5 from a video Store.
Association
major(x, CS) takes(x, DB) grade(x, A) *1%, 75%]
Predicting a missing value, a user profile property that the user did not
submitted on web form.
7
Outlier analysis
technique to fraud detection, network intrusion detection
Retail industry Telecommunication Industry Biological Data Analysis Scientific Applications Sports Astronomy Health Industry Finance Law Agriculture
9
Summary
Data mining: discovering interesting patterns from large amounts of data A KDD process includes data cleaning, data integration, data selection, transformation, data mining, pattern evaluation, and knowledge presentation Data mining functionalities: characterization, discrimination, association, classification, clustering, outlier and trend analysis, etc.
10
References
Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. AAAI/MIT Press, 1996. Data Mining: Concepts and Techniques, J. Han and M. Kamber. Morgan Kaufmann, 2000.
Data Warehousing, Data mining and OLAP, Alex Berson, Stephan J.Smith,1997 Knowledge Discovery and Data Mining in Databases, Vladan Devedzic, Principles of Knowledge Discovery in Databases, Osmar R. Zaiane , 1999
11
13