Title :
Robust biomarker discovery for cancer diagnosis based on meta-ensemble feature selection
Author :
Boucheham, Anouar ; Batouche, Mohamed
Author_Institution :
Comput. Sci. Dept., Coll. of NTIC, Constantine, Algeria
Abstract :
Identification of biomarkers from high dimensional data is one of the most important emerging topics in genomics and personalized medicine. Gene selection aims to find a parsimonious subset of features that has the most discriminative information for a specific disease. The variations in real clinical tests have a great impact on the diagnosis efficiency. This influence makes producing stable or robust signatures a crucial problem in feature selection algorithms. Recently this issue has received great attention. In this paper, we propose a novel Meta-Ensemble Feature Selection approach (MEFS) for biomarker discovery. The latter is based on the concept of meta-ensemble which is a new promising direction in machine learning. The objective is to produce more parsimonious and robust selection with better classification accuracy. The proposed method is different from the conventional ensemble learning techniques and it uses Information Gain (IG) to evaluate the relevance of genes, since it is simple, fast and meaningful for an appropriate ensemble method. The efficiency and the effectiveness of our method were demonstrated through comparisons with single, ensemble versions and other ensemble feature selection techniques. Results have shown that the robustness of MEFS for biomarker discovery can be substantially increased while improving classification accuracy.
Keywords :
bioinformatics; cancer; feature selection; genetics; genomics; health care; learning (artificial intelligence); patient diagnosis; biomarker discovery; cancer diagnosis; gene selection; genomics; information gain; machine learning; metaensemble feature selection approach; personalized medicine; Accuracy; Biological system modeling; Cancer; Computational modeling; Filtering algorithms; Gene expression; Robustness; bioinformatics; biomarker discovery; gene expression profiling; health care systems; metaensemble feature selection; robust feature selection;
Conference_Titel :
Science and Information Conference (SAI), 2014
Conference_Location :
London
Print_ISBN :
978-0-9893-1933-1
DOI :
10.1109/SAI.2014.6918227