Anda di halaman 1dari 3

Research Plan

A Framework of association rule mining using machine learning. Nilamadhab Mishra Keywords:-Machine learning, pattern mining, pattern Classification

1. Introduction:The data mining process takes data from a data warehouse as input and identifies the hidden patterns through the machine learning tools. Association rule mining or pattern mining is one of the most popular data mining methods. "Pattern mining" is a data mining method that involves finding existing patterns in data. The Pattern mining identifies the hidden patterns through classifier and clustering technique. Several experiments are already done to learn and train the network architecture for the data set used in back propagation neural N/W with different activation functions. Further studies may be carried out in classification and clustering. Classification is done through supervised machine learning and clustering through unsupervised machine learning. So

Classification, clustering and pattern mining are the important research issues in data mining.

2. Motivations:The mining of association rules or patterns is one of the most popular problems in data mining. The actual motivations for searching association rules came from the need to analyze the supermarket transaction data, that is, to examine customer behavior in terms of the purchased products. Association rules describe how often items are purchased together. For example, an association rules item X => item Y (80%) states that four out of five customers that bought item X also bought item Y. Such rules can be useful for decisions concerning product pricing, promotions, store layout and many others. The association rule mining [1] or the pattern mining can also be used effectively in software defect predictions, associations and defect classifications. Micro array database[2] is a typical Relational database, which contains a large number of columns and a small number of rows, and it poses a great challenge for existing associated pattern mining algorithms that discover patterns in item enumeration

space complexity . So instead of searching the large number of columns in a micro array database (bioinformatics database), its associated framing patterns should be searched.

3. Artistic work:Here I have discussed the related work for association rules mining using machine learning. Thabtah provides a very detailed survey of current associative classification algorithms. The Association Rule mining can work well for limited number of attributes present inside the relations. Various methods are already proposed. CBA was one of the first associative classifiers and focuses on focusing on mining a special subset of association Rules, called class association rules (CARs). CMAR considers multiple rules at the time of classification using weighted 2. CPAR takes a totally different approach to associative classification in that it does not directly use the rules generated by ARM, but only uses the frequent item sets and their respective counts to build a classifier using a FOIL-like technique called PRM. Even before the advent of associative classifiers, many non-associative classifiers were proposed which was widely known decision tree classifier. Till date a single fuzzy-ARM [3] algorithm is used I.e. APRIORI algorithm. The fuzzy versions of Apriori do not perform fast against very large datasets and datasets with a large number of dimensions/ attributes. Verlinde et. al. describes in a fair amount of detail as how fuzzy Apriori can be used to generate fuzzy association rules. Hullermeier and Yi justify the relevance of fuzzy logic being applied to association rule mining in todays data-mining setup. Further also described in great detail the general model and application of fuzzy association rules.

4. Overall research Challenges


1. To achieve an improved pattern identification through an optimum machine learning model (NN model). 2. To develop a natural data/pattern mapping model for large and complex database. 3. To build a highly accurate and efficient classifiers for large and complex databases

5. Research plan
My plan is to address some problem and some probable solution approaches. Searching large volumes of data automatically, looking for certain patterns as well as creating concise representations of the data that can be used for future prediction or classification tasks. Conventional Associative Classification and Association Rule Mining (ARM) algorithms are inherently designed to work only with binary attributes, and expect any quantitative attributes to be converted to binary ones using ranges, like Age = [10, 60]. In order to mitigate this constraint, Fuzzy logic is used to convert quantitative attributes to fuzzy Binary attributes, like Age = Middle-aged, so as to eliminate any loss of information arising due to sharp partitioning, especially at partition boundaries, and then generate Fuzzy Association Rules using an appropriate Fuzzy ARM Algorithm. These Fuzzy Association Rules can then be used to train a Fuzzy Associative Classifier. Fuzzy Associative Classifiers so built can be used in a wide variety of domains and datasets, like transactional datasets and image datasets. To build highly accurate and efficient classifiers for large databases is one of the essential challenges of data mining and machine learning research. Better I can use some new problem solving approaches like GA-FUZZY based ARM; Neuro-fuzzy based ARM to achieve the desired research goal. Those Approaches can be used for designing Simple and Effective Associative Classifier and to build a set of optimal rules for a large and complex database.

6. References:[1] Nilamadhab Mishra. "Art of software Defect Association and Correction Using Association Rule mining ", International journal of Computer Engineering & Technology (ISSN Print: ISSN 0976 6367, ISSN Online: ISSN 0976 6375), Volume 1 Issue 1(June 2010), PP. 275 285. [2] Nilamadhab Mishra. "A Framework for Associated Pattern Mining over Micro array Database", Journal of Global Research in Computer Science (ISSN-2229-371X), Vol-2, N0-2, February-2011, PP.08 11. [3] A. Mangalampalli and V. Pudi. Fuzzy association rule mining algorithm for fast and efficient performance on very large datasets. In FUZZ-IEEE, pages 11631168, 2009.

Anda mungkin juga menyukai