Title :
Hybrid PCA and LDA Analysis of Microarray Gene Expression Data
Author :
Lu, Yijuan ; Tian, Qi ; Sanchez, Maribel ; Wang, Yufeng
Author_Institution :
Department of Computer Science, University of Texas at San Antonio, TX, USA
Abstract :
Microarray technology offers a high throughput means to study expression networks and gene regulatory networks in cells. The intrinsic nature of high dimensionality and small sample size in microarray data calls for the development of effective computational methods. In this paper, we propose a novel hybrid dimension reduction technique for classification - hybrid PCA (principal component analysis) and LDA (linear discriminant analysis) analysis. This technique effectively solves the singular scatter matrix problem caused by small training samples and increases the effective dimension of the projected subspace. It offers more flexibility and a richer set of alternatives to LDA and PCA in the parametric space. In addition, generalization of hybrid analysis of other dimension reduction techniques is also proposed in this paper, such as multiple discriminant analysis (MDA) and biased discriminant analysis (BDA). Extensive experiments on the yeast cell cycle regulation data set show the superior performance of the hybrid analysis over the traditional methods such as SVM.
Keywords :
Biology; Cells (biology); Computer science; Covariance matrix; Fungi; Gene expression; Linear discriminant analysis; Principal component analysis; Support vector machine classification; Support vector machines;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN :
0-7803-9387-2
DOI :
10.1109/CIBCB.2005.1594942