Title :
Evaluation Models for the Effect of Sample Imbalance on Gene Selection
Author :
Yang, Kun ; Li, Jianzhong ; Wang, Chaokun ; Gao, Hong
Author_Institution :
Dept. of Comput. Sci. & Eng., Harbin Inst. of Technol.
Abstract :
In this paper, we considered the problem of sample imbalance in the context of gene selection. Based on simple random sampling, two evaluation models were proposed to investigate the effect of sample imbalance on gene selection. Under the proposed evaluation models, the performances of five famous gene selection methods on the unbalanced data were compared. The experimental results indicated that the proposed evaluation models are effective and the sample imbalance has a great influence on gene selection. Our findings provide some guidelines in the design of microarray experiments and the following data analysis, and two evaluation models are suitable for selecting feasible gene selection method to identify differential expression genes
Keywords :
biology computing; data analysis; genetics; data analysis; gene selection method; microarray experiment; simple random sampling; Biological system modeling; Cancer; Chaos; Computer science; Data analysis; Gene expression; Guidelines; Performance evaluation; Sampling methods; Statistical analysis;
Conference_Titel :
Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
Conference_Location :
Hanzhou, Zhejiang
Print_ISBN :
0-7695-2581-4
DOI :
10.1109/IMSCCS.2006.59