Slot: S4&S5
Faculty: Florence S
Assignment – II
Level of learning
CO domain (Based on
Course Outcomes
Nos. revised Bloom’s
taxonomy)
Explain the concept of Data mining system and apply the
CO3 various preprocessing techniques on large dataset. K2
Attribute Information:
Diabetes files consist of four fields per record. Each field is separated by a tab and each record is
separated by a newline.
Attribute Information:
Diabetes files consist of four fields per record. Each field is separated by a tab and each record is
separated by a newline.
7. CO4 K3
This question aims to provide you a better understanding of the frequent pattern mining and the
closed/maximal pattern mining.
Implement a frequent pattern mining algorithm (e.g., the Apriori algorithm or FP-
Growth) to mine the frequent patterns from a transaction dataset.
Sample Input 0
2
BAC E D
AC
CBD
Sample Output 0
3 [C]
2 [A]
2 [A C]
2 [B]
2 [B C]
2 [B C D]
2 [B D]
2 [C D]
2 [D]
Sample Input 1
2
data mining
frequent pattern mining
mining frequent patterns from the transaction dataset
closed and maximal pattern mining
Sample Output 1
4 [mining]
2 [frequent]
2 [frequent mining]
2 [mining pattern]
2 [pattern]
Diabetes files consist of four fields per record. Each field is separated by a tab and each record is
separated by a newline.
Consider the heart disease data set which is available in the following link
https://archive.ics.uci.edu/ml/datasets/Heart+Disease.
It contains 14 attributes . attribute num indicates the class label.
Download “Rapid miner” tool using following link
https://rapidminer.com/get-started/
Use “RAPID MINER” data mining tool to complete the following task
1. Preprocess your dataset .
2. Generate association rules using apriori algorithm
3. Mine the frequent patterns using Fp growth.
4. Identify the class labels using decision tree induction
5. Identify the class labels with Multilayer feed forward neural network