Title :
Cascading GA & CFS for Feature Subset selection in Medical Data Mining
Author :
Karegowda, Asha Gowda ; Jayaram, M.A.
Author_Institution :
Dept of Master of Comput. Applic., Siddaganga Inst. of Technol., Tumkur
Abstract :
Feature subset selection is of immense importance in the field of data mining. The increased dimensionality of data makes testing and training of general classification method difficult. This paper presents the development of a model for classifying Pima Indian diabetic database (PIDD). The model consists of two stages. In the first stage, genetic algorithm (GA) and Correlation based feature selection have been used in a cascaded fashion. GA rendered global search of attributes with fitness evaluation effected by CFS. The second stage a fine tuned classification is done using artificial neural networks by making the feature subset elicited in the first stage as inputs for the network. Experimental results signify that the feature subset identified by the proposed filter when given as input to Back propagation neural network classifier, lead to enhanced classification accuracy.
Keywords :
backpropagation; data mining; diseases; feature extraction; genetic algorithms; medical computing; neural nets; pattern classification; Pima Indian diabetic database classification; artificial neural network; attribute searching; backpropagation neural network classifier; correlation based feature selection; data dimensionality; feature subset elicitation; feature subset selection; fitness evaluation; genetic algorithm; medical data mining; Artificial neural networks; Clustering algorithms; Computer applications; Data mining; Electronic mail; Filters; Genetic algorithms; Medical diagnostic imaging; Supervised learning; Unsupervised learning; Correlation based feature selection; Feature selection; Genetic Algorithm; artificial neural network;
Conference_Titel :
Advance Computing Conference, 2009. IACC 2009. IEEE International
Conference_Location :
Patiala
Print_ISBN :
978-1-4244-2927-1
Electronic_ISBN :
978-1-4244-2928-8
DOI :
10.1109/IADCC.2009.4809226