DocumentCode :
3408535
Title :
A mixed factors model for dimension reduction and extraction of a group structure in gene expression data
Author :
Yoshida, Ryo ; Higuchi, Tomoyuki ; Imoto, Seiya
Author_Institution :
Graduate Univ. for Adv. Studies, Tokyo, Japan
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
161
Lastpage :
172
Abstract :
When we cluster tissue samples on the basis of genes, the number of observations to be grouped is much smaller than the dimension of feature vector. In such a case, the applicability of conventional model-based clustering is limited since the high dimensionality of feature vector leads to overfilling during the density estimation process. To overcome such difficulty, we attempt a methodological extension of the factor analysis. Our approach enables us not only to prevent from the occurrence of overfilling, but also to handle the issues of clustering, data compression and extracting a set of genes to be relevant to explain the group structure. The potential usefulness are demonstrated with the application to the leukemia dataset.
Keywords :
biological tissues; biology computing; blood; cancer; data compression; genetics; physiological models; statistical analysis; cluster analysis; data compression; density estimation process; dimension reduction; factor analysis; feature vector; gene expression data; group structure extraction; leukemia; mixed factors model; tissue samples; Bioinformatics; Data compression; Data mining; Diseases; Eigenvalues and eigenfunctions; Gene expression; Genomics; Humans; Mathematics; Principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332429
Filename :
1332429
Link To Document :
بازگشت