Title of article :
Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network
Author/Authors :
Elgin Christo, V. R Anna University - Chennai - Tamil Nadu, India , Khanna Nehemiah, H Anna University - Chennai - Tamil Nadu, India , Minu, B Anna University - Chennai - Tamil Nadu, India , Kannan, A School of Computer Science and Engineering - Vellore Institute of Technology - Vellore - Tamil Nadu, India
Abstract :
A framework for clinical diagnosis which uses bioinspired algorithms for feature selection and gradient descendant backpropagation neural network for classification has been designed and implemented. The clinical data are subjected to data
preprocessing, feature selection, and classification. Hot deck imputation has been used for handling missing values and min-max
normalization is used for data transformation. Wrapper approach that employs bioinspired algorithms, namely, Differential
Evolution, Lion Optimization, and Glowworm Swarm Optimization with accuracy of AdaBoostSVM classifier as fitness function
has been used for feature selection. Each bioinspired algorithm selects a subset of features yielding three feature subsets.
Correlation-based ensemble feature selection is performed to select the optimal features from the three feature subsets. The
optimal features selected through correlation-based ensemble feature selection are used to train a gradient descendant backpropagation neural network. Ten-fold cross-validation technique has been used to train and test the performance of the classifier.
Hepatitis dataset and Wisconsin Diagnostic Breast Cancer (WDBC) dataset from University of California Irvine (UCI) Machine
Learning repository have been used to evaluate the classification accuracy. An accuracy of 98.47% is obtained for Wisconsin
Diagnostic Breast Cancer dataset, and 95.51% is obtained for Hepatitis dataset. 'e proposed framework can be tailored to develop
clinical decision-making systems for any health disorders to assist physicians in clinical diagnosis.
Keywords :
Algorithms , AdaBoostSVM , Backpropagation , Correlation-Based
Journal title :
Computational and Mathematical Methods in Medicine