Title :
A Hybrid Improved Ant Colony Optimization and Random Forests Feature Selection Method for Microarray Data
Author :
Xiong, Wen ; Wang, Cong
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Microarray gene expression data have been used in cancer discovery and prediction characterized by their small samples and large dimensionality. This paper proposes a hybrid method based on improved Ant Colony Optimization (ACO) and Random Forests (RF) for selecting a small set of marker genes from microarray data to produce high accuracy cancer classifier. The method preselects top-ranked features using a statistic t-test combined with feature importance score estimated by Random Forests. It uses the combined score as heuristic info and the classification accuracy of Random Forests as positive feedback for ant colony to refine the feature subset preselected. In order to accelerate convergence of ant colony, it distributes ants to different features and confines the size of solution to obtain quickly optimum and near-optimum. As a post processing, it employs restricted sequential forward selection (SFS) to construct optimum from near-optimum. Experiments show the method proposed provides higher recognition with smaller feature subset on two microarray gene expression data.
Keywords :
bioinformatics; cancer; data mining; optimisation; ant colony optimization; cancer; feature selection method; gene; microarray data; random forests; sequential forward selection; Ant colony optimization; Cancer; Clustering algorithms; Data analysis; Data mining; Drugs; Feedback; Gene expression; Machine learning algorithms; Radio frequency; Ant Colony Optimization (ACO); Random Forests (RF); cancer classification; feature selection; sequential forward selection (SFS);
Conference_Titel :
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-5209-5
Electronic_ISBN :
978-0-7695-3769-6
DOI :
10.1109/NCM.2009.66