DocumentCode :
3230537
Title :
FMGA: finding motifs by genetic algorithm
Author :
Liu, Falcon F M ; Tsai, Jeffrey J P ; Chen, R.M. ; Chen, S.N. ; Shih, S.H.
Author_Institution :
Inst. of Bioinformatics, Taichung Healthcare & Manage. Univ., Taiwan
fYear :
2004
fDate :
19-21 May 2004
Firstpage :
459
Lastpage :
466
Abstract :
In the era of post-genomics, almost all the genes have been sequenced and enormous amounts of data have been generated. Hence, to mine useful information from these data is a very important topic. In this paper we propose a new approach for finding potential motifs in the regions located from the -2000 bp upstream to +1000 bp downstream of transcription start site (TSS). This new approach is developed based on the genetic algorithm (GA). The mutation in the GA is performed by using position weight matrices to reserve the completely conserved positions. The crossover is implemented with special-designed gap penalties to produce the optimal child pattern. We also present a rearrangement method based on position weight matrices to avoid the presence of a very stable local minimum, which may make it quite difficult for the other operators to generate the optimal pattern. Our approach shows superior results by comparing with multiple em for motif elicitation (MEME) and Gibbs sampler, which are two popular algorithms for finding motifs.
Keywords :
biology computing; data mining; genetic algorithms; genetics; pattern recognition; sequences; FMGA; data mining; gene sequences; genetic algorithm; motifs; optimal child pattern; position weight matrices; transcription start site; Artificial neural networks; Bioinformatics; DNA; Genetic algorithms; Medical services; Polymers; Proteins; RNA; Sequences; Technology management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
Print_ISBN :
0-7695-2173-8
Type :
conf
DOI :
10.1109/BIBE.2004.1317378
Filename :
1317378
Link To Document :
بازگشت