Title :
Biclustering Gene Expression Profiles by Alternately Sorting with Weighted Correlated Coefficient
Author :
Teng, Li ; Chan, Lai-Wan
Author_Institution :
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hongkong, Hong Kong
Abstract :
This paper proposes a framework for biclustering gene expression profiles. The framework applies dominant set approach to create sets of sorting vectors. With these sorting vectors, we iteratively sort and transpose the gene expression data. Weighted correlation coefficient is used to measure the similarity in the gene level and the condition level. The weights are assigned according to the similarity measures in the previous level. We refine and update the weights of our similarity measurement in each iteration. This enables us to concentrate on measuring the similarity of relevant features during the biclustering process. In this way, a highly correlated bicluster could be easily located. We have applied this biclustering approach to three real gene expression data sets and found the results very encouraging. In addition, we propose the average correlation value (ACV), a criterion to evaluate the property of a bicluster. This criterion has been compared with the mean squared residue score and ACV is found to be more appropriate.
Keywords :
biology computing; genetics; pattern clustering; sorting; average correlation value; correlated bicluster; dominant set; gene expression data biclustering; gene expression profile; sorting vector; weighted correlated coefficient; Biomedical measurements; Clustering algorithms; Computer science; Data analysis; Gene expression; Iterative algorithms; Iterative methods; Parameter estimation; Search methods; Sorting;
Conference_Titel :
Machine Learning for Signal Processing, 2006. Proceedings of the 2006 16th IEEE Signal Processing Society Workshop on
Conference_Location :
Arlington, VA
Print_ISBN :
1-4244-0656-0
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2006.275563