Title :
Discovering phenotype specific gene module using a novel biclustering algorithm in colorectal cancer
Author :
Jungrim Kim ; Jeagyoon Ahn ; Youngmi Yoon ; Yunku Yeu ; Sanghyun Park
Author_Institution :
Dept. of Comput. Sci., Yonsei Univ., Seoul, South Korea
Abstract :
Gene clustering is a method for finding gene sets which are related to the same biological processes or molecular function. In order to find these gene sets, previous studies have clustered genes which showed similar mRNA expression or a specific expression pattern in a (sub) sample set. However, for two contrasting groups of samples, it is not easy to identify gene sets which show significant expression pattern in only one group using current gene clustering methods. Existing biclustering methods use only one group (disease) of samples. It is hard to identify disease specific biclusters which are differentially expressed in the disease although those methods can find biclusters which have specific expression pattern. Here, we proposed a novel method using a genetic algorithm in gene expression data, in order to find gene sets which can represent specific subtype of cancer. Proposed method finds gene sets which have statistically differential mRNA expression on two contrasting samples and fraction of cancer samples. The resulting gene modules share higher number of GO (Gene Ontology) terms related to a specific disease than gene modules identified by current algorithms. We also identify that when we integrate protein-protein interaction data with gene expression data of colorectal cancer samples, proposed method can find more functionally related gene sets.
Keywords :
RNA; biochemistry; bioinformatics; biological organs; cancer; data mining; genetic algorithms; genetics; medical computing; molecular biophysics; molecular configurations; ontologies (artificial intelligence); pattern clustering; pattern matching; proteins; statistical analysis; text analysis; GO term; biclustering algorithm; biological process; cancer subtype; colorectal cancer; differential expression; disease related gene; disease specific bicluster identification; functionally related gene set; gene clustering; gene expression data; gene expression pattern similarity; gene module identification; gene ontology term; gene set finding; gene set identification; gene similarity; genetic algorithm; mRNA expression similarity; molecular function; phenotype specific gene module discovery; protein-protein interaction data integration; statistically differential mRNA expression; Algorithm design and analysis; Bioinformatics; Cancer; Diseases; Gene expression; Genetic algorithms; Proteins; Biclustering; Gene module; Genetic Algorithm; Microarray;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
DOI :
10.1109/BIBM.2014.6999154