DocumentCode :
1448461
Title :
A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis
Author :
Lazar, Cosmin ; Taminau, Jonatan ; Meganck, Stijn ; Steenhoff, David ; Coletta, Alain ; Molter, Colin ; De Schaetzen, Virginie ; Duque, Robin ; Bersini, Hugues ; Nowé, Ann
Author_Institution :
Dept. of Comput. Sci., Vrije Univ. Brussel, Brussels, Belgium
Volume :
9
Issue :
4
fYear :
2012
Firstpage :
1106
Lastpage :
1119
Abstract :
A plenitude of feature selection (FS) methods is available in the literature, most of them rising as a need to analyze data of very high dimension, usually hundreds or thousands of variables. Such data sets are now available in various application areas like combinatorial chemistry, text mining, multivariate imaging, or bioinformatics. As a general accepted rule, these methods are grouped in filters, wrappers, and embedded methods. More recently, a new group of methods has been added in the general framework of FS: ensemble techniques. The focus in this survey is on filter feature selection methods for informative feature discovery in gene expression microarray (GEM) analysis, which is also known as differentially expressed genes (DEGs) discovery, gene prioritization, or biomarker discovery. We present them in a unified framework, using standardized notations in order to reveal their technical details and to highlight their common characteristics as well as their particularities.
Keywords :
arrays; bioinformatics; genetics; information filters; GEM analysis; bioinformatics; biomarker discovery; combinatorial chemistry; differentially expressed gene discovery; filter feature selection methods; gene expression microarray analysis; gene prioritization; multivariate imaging; standardized notations; text mining; Bioinformatics; Computational biology; Gene expression; Measurement; Search methods; Taxonomy; Feature selection; biomarker discovery; gene expression data.; gene prioritization; gene ranking; information filters; scoring functions; statistical methods; Analysis of Variance; Bayes Theorem; Computational Biology; Gene Expression Profiling; Genetic Markers; Information Theory; Models, Statistical; Oligonucleotide Array Sequence Analysis; ROC Curve; Statistics, Nonparametric;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.33
Filename :
6152088
Link To Document :
بازگشت