Title :
Alternative Feature Mapping for Heterogeneous Gene Data Classification
Author :
Liang, Victor C. ; Ng, Vincent T Y
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ. Kowloon, Kowloon, China
Abstract :
In order to overcome the limitation on small sizes of gene datasets, many meta-classification methods which ensemble classifiers with different datasets have been developed. However, due to discrepancies of the characteristics within heterogeneous or cross-platform datasets, the number of common and significant genes is usually small. Instead of matching common genes between heterogeneous datasets, we propose a novel solution, alternative feature mapping approach (AFM), to utilize related and discriminative gene expressions while not necessarily having exact matches. Genes in the training dataset are clustered and mapped to the test dataset as gene groups. Through analyzing the correlation within gene groups between training and test datasets, related significant genes can be applied for classification. We conducted experiments consisting of 8 heterogeneous datasets with different cancer types and platforms to test the effectiveness of AFM. Our experiments show that classification performance is greatly improved using suitable significant genes selected by AFM.
Keywords :
bioinformatics; cancer; genetics; pattern clustering; AFM; alternative feature mapping approach; cancer; clustered dataset; cross-platform dataset; discriminative gene expression; heterogeneous gene data classification performance; meta-classification method; test dataset; Bioinformatics; Biomedical engineering; Cancer; DNA; Diseases; Gene expression; Partitioning algorithms; Probes; Spatial databases; Testing; classification; feature mapping; heterogeneous; microarray;
Conference_Titel :
Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-0-7695-3656-9
DOI :
10.1109/BIBE.2009.21