DocumentCode :
1113
Title :
Multiclass Gene Selection Using Pareto-Fronts
Author :
Rajapakse, Jagath C. ; Mundra, Piyushkumar A.
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
10
Issue :
1
fYear :
2013
fDate :
Jan.-Feb. 2013
Firstpage :
87
Lastpage :
97
Abstract :
Filter methods are often used for selection of genes in multiclass sample classification by using microarray data. Such techniques usually tend to bias toward a few classes that are easily distinguishable from other classes due to imbalances of strong features and sample sizes of different classes. It could therefore lead to selection of redundant genes while missing the relevant genes, leading to poor classification of tissue samples. In this manuscript, we propose to decompose multiclass ranking statistics into class-specific statistics and then use Pareto-front analysis for selection of genes. This alleviates the bias induced by class intrinsic characteristics of dominating classes. The use of Pareto-front analysis is demonstrated on two filter criteria commonly used for gene selection: F-score and KW-score. A significant improvement in classification performance and reduction in redundancy among top-ranked genes were achieved in experiments with both synthetic and real-benchmark data sets.
Keywords :
Pareto analysis; biology computing; genetics; genomics; F-score; KW-score; Pareto-front analysis; filter methods; microarray data; multiclass gene selection; tissue samples; Benchmark testing; Bioinformatics; Cancer; Computational biology; Gene expression; Redundancy; Training; Aggregation statistics; Pareto-front analysis; filter methods; gene selection; multiobjective evolutionary optimization; Algorithms; Computational Biology; Databases, Genetic; Gene Expression Profiling; Humans; Models, Genetic; Models, Statistical; Neoplasms; Statistics, Nonparametric;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.1
Filename :
6407130
Link To Document :
بازگشت