Title :
Hybrid feature selection method for biomedical datasets
Author :
Solorio-Fernández, Saúl ; Trinidad, José Fco Martínez- ; Carrasco-Ochoa, Jesús Ariel ; Zhang, Yan-Qing
Author_Institution :
Comput. Sci. Dept., Nat. Inst. for Astrophys., Opt. & Electron., Tonantzintla, Mexico
Abstract :
Currently classifying high-dimensional data is a very challenging problem. High dimensional feature spaces affect both accuracy and efficiency of supervised learning methods. To address this issue, we present a fast and efficient feature selection algorithm to facilitate classifying high-dimensional datasets as those appearing in Bioinformatics problems. Our method employs a Laplacian score ranking to reduce the search space, combined with a simple wrapper strategy to find a good feature subset of uncorrelated features, giving as result a hybrid feature selection method which is useful for high dimensional spaces. Some experiments have been carried out on gene microarray datasets to demonstrate the effectiveness and robustness of the proposed method.
Keywords :
bioinformatics; biological techniques; data reduction; feature extraction; genetics; learning (artificial intelligence); molecular biophysics; pattern classification; Laplacian score ranking; bioinformatics problems; biomedical datasets; feature selection algorithm; feature subset; gene microarray datasets; high dimensional data classification; high dimensional feature spaces; hybrid feature selection method; search space reduction; supervised learning methods; uncorrelated features; wrapper strategy; Accuracy; Bioinformatics; Cancer; Classification algorithms; Filtering algorithms; Laplace equations; Machine learning; Feature selection; high-dimensional spaces; supervised classification;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1190-8
DOI :
10.1109/CIBCB.2012.6217224