Title :
RNA Search Acceleration with Genetic Algorithm Generated Decision Trees
Abstract :
The computing time requirements for using covariance models to search for non-coding RNA genes in genomic data make direct use of these models for RNA search impractical for RNA families with long consensus sequences. A possible solution is to break family models into portions and apply these portions sequentially on a segment of database until either an acceptance or rejection can be obtained on the given database segment. As will be shown, the application order and accept/reject decisions can be rather complex. A decision tree framework with genetic algorithms used to find the family-specific decision trees is proposed in this work to handle these complex cases. Computational speed-ups for database search of more than a factor of 15 have been observed using these methods as presented in the experimental section of this work.
Keywords :
decision trees; genetic algorithms; genomics; RNA; decision trees; genetic algorithm; genomic data; Acceleration; Bioinformatics; Computational complexity; Databases; Decision trees; Genetic algorithms; Genomics; Hidden Markov models; RNA; Shape; bioinformatics; covariance models; non-coding RNA;
Conference_Titel :
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-0-7695-3495-4
DOI :
10.1109/ICMLA.2008.77