Title :
A New Two-steps Gene Expression Data Clustering Method
Author :
YanJie Zhang ; Prinet, Veronique ; Shuanhu Wu
Author_Institution :
Sch. of Comput. Sci. & Technol., Yantai Universiy Nat., Yantai, China
Abstract :
Gene expression data analysis is very important for the research on gene regulatory mechanisms. Genes which exhibit similar patterns are often functionally related. In this paper a novel bicluster detection method is proposed. Its advantage lies in it can not only make use of the traditional data clustering methods, but also form a systemic architecture. The whole processing procedure can be divided into two parts. The first is using one existing clustering method to cluster all the 2-combinations of the data matrix in the direction where the dimensionality is smaller, Then based on the clustering results some binary tables are created. The second part is to verify the concatenated quasi-bicluster. Since the data in the same bicluster is highly correlated with each other, a principal component analysis (PCA) based efficient verification method is applied, which can also work in noisy environment. The whole processing aims at finding all of the possible biclusters from large to small.
Keywords :
biology computing; data analysis; pattern clustering; principal component analysis; bicluster detection method; binary tables; data matrix; gene expression data analysis; gene regulatory mechanisms; principal component analysis; two-steps gene expression data clustering method; verification method; Automation; Clustering methods; Computer science; Concatenated codes; Data analysis; Fuzzy systems; Gene expression; Laboratories; Pattern recognition; Principal component analysis; Bicluster; Principal Component Analysis;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.481