Title :
Biclustering of Gene Expression Data Using Genetic Algorithm
Author :
Chakraborty, Anupam ; Maka, Hitashyam
Author_Institution :
Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur-721302, India, Email: anupamc@iitkgp.ernet.in
Abstract :
The biclustering problem of gene expression data deals with finding a subset of genes which exhibit similar expression patterns along a subset of conditions. Most of the current algorithms use a statistically predefined threshold as an input parameter for biclustering. This threshold defines the maximum allowable dissimilarity between the cells of a bicluster and is very hard to determine beforehand. Hence we propose two genetic algorithms that embed greedy algorithm as local search procedure and find the best biclusters independent of this threshold score. We also establish that the HScore of a bicluster under the additive model approximately follows chi-square distribution. We found that these genetic algorithms outperformed other greedy algorithms on yeast and lymphoma datasets.
Keywords :
Clustering algorithms; DNA; Data analysis; Evolutionary computation; Fungi; Gene expression; Genetic algorithms; Greedy algorithms; Interference; Iterative algorithms;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN :
0-7803-9387-2
DOI :
10.1109/CIBCB.2005.1594893