DocumentCode :
445854
Title :
Characterizing human gene splice sites using evolved regular expressions
Author :
Li, Jing-Jing ; Huang, De-Shuang ; MacCallum, Robert M. ; Wu, Xiao-Run
Author_Institution :
Intelligent Comput. Lab., Chinese Acad. of Sci., Hefei, China
Volume :
1
fYear :
2005
fDate :
31 July-4 Aug. 2005
Firstpage :
493
Abstract :
In this paper, an algorithm using evolved regular expressions to characterize and predict human gene splice sites without any prior knowledge is described. In contrast to previous pattern-based approaches to the splice site detection problem, the patterns to be matched are unknown in advance and discovered using a supervised learning approach. We have used a genetic programming based system, PerlGP, to evolve regular expressions and proper length windows for a long sequence in which the evolved regular expressions can effectively characterize and predict the splice junctions. Since the gene splicing process is too complex to be fully understood currently, and the widely accepted consensus sequences only reflect some partial statistical information around splice sites, not to mention defining a splice site. However, our evolved regular expressions may shed new light on the underlying rules that define splice sites. Experimental results demonstrate that using the evolved regular expressions, splice junctions could be accurately characterized, furthermore, these evolved regular expressions could also be employed as a predictor to detect whether a CT/AG containing sequence is a splice site or not. Our experimental results also exhibit that the performance of this approach for predicting human gene splice junctions is competitive compared with some other traditional methods.
Keywords :
biocomputing; genetic algorithms; genetics; learning (artificial intelligence); pattern classification; PerlGP; evolved regular expressions; genetic programming; human gene splice; supervised learning; Bioinformatics; DNA; Genomics; Humans; Machine intelligence; Organisms; Proteins; RNA; Sequences; Splicing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
Type :
conf
DOI :
10.1109/IJCNN.2005.1555880
Filename :
1555880
Link To Document :
بازگشت