No
TITLE
APPLYING MODEL FOR PREDICTION IMPLEMENTATION OF BAYESIAN MODEL ON IMPORTED DATA CROSS VALIDATION CREATION OF GENERIC OPTIMIZATION PREPROCESSOR REPRESENTING DATA USING DECISION TREE
10
Lab Manual Prepared by: Prof (Dr). V. SARAVANAN, School of Computer Applications, LPU
1
IMPORTING DATA INTO RAPIDMINER AIM: To import files into RapidMiner tool using three methods. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: In the welcome perspective that opens, select the new icon. Method 1: Step 3: Select the repository location and click ok. Step 4: In the design perspective, click operators view import data Read Excel. Step 5: The Read Excel operator will be displayed in the process view. Step 6: Connect the out of the operator to the res of the process. Step 7: Click on the operator to view the parameters and browse the file to be imported. Step 8: Run the process. The result can be viewed in metadata, data and plot views. Method 2: Step 9: In the design perspective, click on the repositories view. Step 10: Click on the Import drop down, select import Excel sheet. Step 11: Select the file to be imported and` click Next. Step 12: Specify a repository location, click Finish and view the result. Method 3: Step 13: Drag the file to be imported and drop it in the NewLocalRepository of the repositories View and click Next. Step 14: Specify a repository location, click Finish and view the result. SCREENSHOTS: Method 1:
Method 2:
Method 3:
10
11
RESULT: The files are imported into the RapidMiner tool using the three methods.
12
STORING AND RETRIEVING DATA AIM: To store and retrieve data into RapidMiner tool. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: In the welcome perspective that opens, select the new icon. Step 3: Select the repository location and click ok. Step 4: In the design perspective, click operators view import data Read Excel. Step 5: The Read Excel operator will be displayed in the process view. Step 6: In the design perspective, click operators view Repository AccessStore. Step 7: The Store operator will be displayed in the process view. Step 8: Connect the out of the operator to the inp of the Store and thr of store to the res of process. Step 9: Click on the operator to view the parameters and browse the file to be imported. Step 10: Run the process. The result can be viewed in metadata, data and plot views.
13
14
15
Retrieving data:
16
RESULT: The data are stored and retrieved into the RapidMiner tool.
17
18
SCREEN SHOTS:
19
20
APPLYING MODEL FOR PREDICTION AIM: To predict the unknown data using neural net and apply models in RapidMiner tool. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: In the welcome perspective that opens, select the new icon. Step 3: Select the repository location and click ok. Step 4: Import two csv files containing the mark and result details. Step 5: The result field of first file is given as label and of the second as prediction. Step 6: Retrieve both the files into the design view of the process. Step 7: In the design perspective, click operators view Modeling Classification and RegressionNeural Net trainingNeural Net. Step 8: In the design perspective, click operators view Modeling Model ApplicationApply Model. Step 9: Connect the out of the retrieve to the tra of the neural net. The mod of neural net is connected to the mod of apply model. Step 10: Connect the out of the retrieve to the unl of the apply model. The mod and lab of apply model is connected to the res of the process. Step 11: Run the process to view the predicted data and improved neural net.
SCREENSHOTS:
21
22
23
24
RESULT: The unknown data are predicted using neural net and apply model of RapidMiner tool.
25
IMPLEMENTATION OF NAVE BAYESIAN MODEL ON IMPORTED DATA AIM: To implement the nave baysian on imported data. ALGORITHM: STEP 1: Open the rapid miner tool. STEP 2 : In the process window place the retrieve operator and select the dataset you want to import. STEP 3: Place the nave bayes operator and connect it with the retrieve operator. STEP 4:Place the Apply model operator and connect it with the nave bayes . STEP 5: Run the process
26
SCREEN SHOTS:
27
RESULT: Thus the naive baysian has been implemented on the imported data successfully.
28
CROSS VALIDATION AIM: To perform cross validation in RapidMiner tool using validation operator. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: In the welcome perspective that opens, select the new icon. Step 3: Select the repository location and click ok. Step 4: Import an excel file and retrieve it. Step 5: In the design perspective, click operators view Evaluation Validation XValidation. Step 6: Click on the validation process operator which will lead to the training and testing process. Step 7: Click operators viewModelingClassification and RegressionBayesian ModelingNave Bayes and place it in training process. Step 8: Click Modeling Model Application Apply Model and EvaluationPerformance, MeasurementPerformance and place them in testing process and make connections. Step 9: Connect the ave of the validation operator to the res of the process. Step 10: Run the process. The result can be viewed in performance vector view.
SCREENSHOTS:
29
30
31
RESULT: The cross validation is performed using validation operator of RapidMiner tool.
32
CREATION OF GENERIC OPTIMIZATION PREPROCESSOR AIM: To create a generic optimization preprocessor in RapidMiner tool. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: Select a location and name for the local repository. Step 3: Select Operators view Utility Data Generation Generate Data Step 4: Select Operators view Data Transformation Attribute set Reduction and Transformation Selection OptimizationOptimize selection Step 5: Connect the operators and click on the process which then leads to the validation process. Step 6: Select operators view EvaluationValidationSplit validation and make the connections. Step 7: Click on the process which proceeds to the training and testing process. Step 8: Select operators ModelingClassification and Regression Support Vector Modeling Support Vector Machine in the training phase. Step 9: Select operatorsModeling Model Application Apply Model and Evaluation Performance and Measurement Performance in the testing phase. Step 10: Make the appropriate connections. Step 11: Run the process in forward selection and backward elimination mode.
SCREENSHOTS:
33
34
35
Forward Selection:
36
Backward Elimination:
RESULT: The forward and backward optimization is created using optimize selection operator of the RapidMiner tool.
37
REPRESENTING DATA USING DECISION TREE AIM: To represent data using decision tree in the RapidMiner tool. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: Select a location and name for the local repository. Step 3: Select Operators view Utility Data Generation Generate Direct Mailing Data Step 4: Select operators view EvaluationValidationSplit validation and make the connections. Step 5: Open the process which proceeds to the training and testing process. Step 6: In the training phase, select operators ModelingClassification and Regression Tree Induction Decision tree Step 7: Place the apply model and performance operators in the testing phase. Step 8: Make the appropriate connections. Step 9: Run the process to view the performance and decision tree. SCREENSHOTS:
38
39
RESULT: Thus the data is represented using decision tree in the RapidMiner tool.
40
EVOLUTIONARY WEIGHTING OF THE ATTRIBUTES AIM: To perform the evolutionary weighting of the attributes using RapidMiner. ALGORITHM: Step 1: Open the RapidMiner tool. Step 2: Select Operators Modeling Attribute WeightingOptimize Weights (evolutionary) and place it in the main process. Step 3: Import and retrieve a dataset whose character field is label. Step 4: On double clicking, Optimize weights it leads to Evaluation Process. Place Split Validation operator in it. Step 5: Again on double clicking the process, leads to Training and Testing phase. Step 6: In Training phase, place Neural net. Step 7: In testing phase, place Apply Model and Performance. Step 8: In the main process, click on the optimize weights operator. The right side of the window have several options to be modified. Step 9: Give the population size as 10, check early stopping and show population plotter, set the selection scheme as roulette wheel, p crossover as 0.2 and crossover type as shuffle. Step 10: Make the necessary connections and run the process.
RESULT: Thus the evolutionary weighting of the attributes are obtained using RapidMiner. 41
42
SCREEN SHOTS:
43
44
45
RESULT: Thus the text document has been mined by using the rapid miner successfully.
46