Title :
Biclustering gene expression data by random projection based on bucketing
Author :
Liu, Juan ; Liu, Feng
Author_Institution :
Sch. of Comput., Wuhan Univ., Wuhan
Abstract :
It is an important task for biologists to analyze gene expression data with microarray technology development. Biclustering of gene expression data is the process of grouping a subset of genes over a subset of conditions into a class, in which each gene behaviors similarly over the selected conditions and each condition is related to a certain classification. In this paper, we present a random projection method to find the largest biclusters from gene expression data. To avoid sampling the column uniformly, we adopt the bucketing technology to estimate the probability to sample each column. Experiments show that our method can find the largest biclusters in simulation data and real data.
Keywords :
biology computing; genetics; pattern clustering; bucketing; gene expression data biclustering; microarray technology development; random projection method; Biomedical engineering; Clustering algorithms; Data analysis; Data engineering; Data mining; Gene expression; Information technology; Pattern recognition; Sampling methods; Testing;
Conference_Titel :
Information Technology and Applications in Biomedicine, 2008. ITAB 2008. International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4244-2254-8
Electronic_ISBN :
978-1-4244-2255-5
DOI :
10.1109/ITAB.2008.4570606