Title :
Toward optimal selection of feature clusters
Author :
Yu, Lei ; Li, Hao
Author_Institution :
Binghamton Univ., Binghamton
Abstract :
In microarray data analysis, the large number of equally predictive gene sets and the disparity among them reveals the gap between necessary genes for accurate models and candidate genes for biomarkers. We propose to bridge this gap by a new learning task, feature cluster selection, which aims to select all relevant features in a data set and group them into coherent clusters. We provide problem definitions and an empirical solution to feature cluster selection. Experiments on microarray data show that our proposed solution can select highly predictive representative gene sets and discover gene clusters with statistical significance.
Keywords :
biology computing; data analysis; genetics; learning (artificial intelligence); pattern classification; biomarkers; feature cluster selection; learning task; microarray classification; microarray data analysis; predictive gene sets; Accuracy; Application software; Biological system modeling; Biomarkers; Bridges; Computer science; Data analysis; Machine learning; Predictive models; Training data;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.93