Title :
Multiclass microarray gene expression classification based on fusion of correlation features
Author :
Chetty, G. ; Chetty, M.
Author_Institution :
Fac. of Inf. Sci. & Eng., Univ. of Canberra, Canberra, ACT, Australia
Abstract :
In this paper, we propose novel algorithmic models based on fusion of independent and correlated gene features for multiclass microarray gene expression classification. It is possible for genes to get co-expressed via different pathways. Moreover, a gene may or may not be co-active for all samples. In this paper, we approach this problem with a optimal feature selection technique using analysis based on statistical techniques to model the complex interactions between genes. The two different types of correlation modelling techniques based on the cross modal factor analysis (CFA) and canonical correlation analysis (CCA) were examined. The subsequent fusion of CCA/CFA features with principal component analysis (PCA) features at feature-level, and at score-level result in significant enhancement in classification accuracy for different data sets corresponding to multiclass microarray gene expression data.
Keywords :
biology computing; genetics; principal component analysis; sensor fusion; CCA; CFA; canonical correlation analysis; correlated gene feature; correlation modelling technique; cross modal factor analysis; multiclass microarray gene expression classification; optimal feature selection technique; principal component analysis; statistical technique; Accuracy; Correlation; Feature extraction; Gene expression; Principal component analysis; Redundancy; Tumors; correlation features; gene expression analysis; molecular classification; multivariate statistics;
Conference_Titel :
Information Fusion (FUSION), 2010 13th Conference on
Conference_Location :
Edinburgh
Print_ISBN :
978-0-9824438-1-1
DOI :
10.1109/ICIF.2010.5711915