HYBRID MODEL FOR PERFORMANCE EVALUATION AND FEATURE SELECTION APPLIED ON ENDOMETRIAL CANCER DATA
Journal: IPASJ International Journal of Information Technology (IIJIT) (Vol.6, No. 9)Publication Date: 2018-10-08
Authors : A. Hency Juliet Dr.R. Padmajavalli;
Page : 011-016
Keywords : ;
Abstract
ABSTRACT Many factors affecting the success of data mining techniques, the pureness of data is one of the factors. The inclusion of irrelevant and noisy data in the pattern analyzing phase, can results poor predicting performance. To discover information from the endometrial carcinoma data set the process of cleaning, transforming and modeling are applied. Diverse kinds of pre processing techniques were functions in the data set in order to work with the full pledged data set. Methodology Used: The data mining tool Rapid Miner 5 and WEKA were used for this purpose. In this study, the endometrial cancer data was taken from the TCGA data portal with 547 patients' clinical data. Feature selection was done using various classifiers with the help of different evaluators and search method. Using R language the accuracy of the classifiers' model was checked for the minimized and full data sets. Findings: Hybrid model was adopted for the performance evaluation by combining naïve bayes and decision tree classifier and the accuracy of the new model is 92.71%. Keywords: Endometrial, R, carcinoma, classifiers, naïve bayes, decision tree.
Other Latest Articles
- Intersectoriality and community participation: paradigm of Public Health in Cuba
- MINING USER MOVEMENT BEHAVIOR PATTERNS IN A MOBILE SERVICE ENVIRONMENT
- EAACK - A SECURE INTRUSION DETECTION SYSTEM FOR MANETS
- ENHANCING TEACHING AND LEARNING FOR PLACEMENT TRAINING THROUGH LEARNING ANALYTICS USING R- TOOL
- Some considerations about teenage pregnancy
Last modified: 2018-10-09 00:23:51