Title :
Ensemble vote approach for predicting primary tumors using data mining
Author :
Naib, Mehak ; Chhabra, Amit
Author_Institution :
Dept. of Comput. Sci. & Eng., Guru Nanak Dev Univ., Amritsar, India
Abstract :
Primary tumor is a neoplasm which in clinical parlance is regarded as malignant, arising in one site and capable of giving rise to metastatic tumors. Primary tumor disease is a major health problem in today´s time. This paper aims at analyzing various data mining techniques for primary tumor prediction. The observations reveal that the hybrid approach of any three classifiers using Vote ensemble technique on resampled dataset has outperformed over all other single data mining classifiers.The study considers total 19 attributes by adding `small-intestine´ an attribute in the original primary tumor dataset. By addition of `small-intestine´ attribute, ensemble Vote classifier achieves high accuracy of 94.01% even when the data set contains missing values. Evaluations and results are carried out with 10-fold cross validation using Weka 3-6-10.
Keywords :
data mining; medical computing; pattern classification; tumours; 10-fold cross validation; Weka 3-6-10; clinical parlance; data mining; ensemble Vote classifier; ensemble vote approach; metastatic tumors; neoplasm; primary tumor disease; primary tumor prediction; small-intestine attribute; Accuracy; Cancer; Classification algorithms; Data mining; Diseases; Lungs; Tumors; ARFF; Data mining; Primary Tumor; Resampling; Vote; WEKA;
Conference_Titel :
Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
Conference_Location :
Noida
Print_ISBN :
978-1-4799-4237-4
DOI :
10.1109/CONFLUENCE.2014.6949288