DocumentCode :
1991256
Title :
An Intelligent System for Searching Genomic Sequences
Author :
Gurnmuluru, V. ; Chen, Su-Shing
Author_Institution :
Floirda Univ., Gainesville
fYear :
2007
fDate :
14-17 Oct. 2007
Firstpage :
982
Lastpage :
986
Abstract :
In this paper, we have developed an intelligent system for searching comparative genomic sequences which departs from the traditional sequence alignment methods of nucleic residues or alphabets. Instead, we use the composition vector method that exploits pattern structures in sequences and indexing techniques for building a genomic database of prokaryotic organisms and their phylogenetic relationships. For the structural analysis of prokaryotic patterns, we use this composition vector to express various fuzzy sequence pattern queries on genomic data that would be difficult to represent in traditional database technology. B.L. Hao and his group have used the composition vector method to construct a phylogenetic tree of prokaryotes to understand the evolutionary history of prokaryotic organisms. The composition vector method is based on counting the frequency of nucleotides of a fixed length K in the collection of gene sequences of each species. This method transforms variable length sequences to a fixed length vector. In addition to elaborating on the composition vector method, we also dwell on the sequence pattern queries, the implementation with its reasoning before we finally wrap up with a discussion which we are sure will kindle some more thoughts and views to progress this work.
Keywords :
biological techniques; cellular biophysics; genetic algorithms; genetics; intelligent networks; molecular biophysics; composition vector method; fuzzy sequence pattern queries; gene sequences; genomic database; genomic sequences; indexing; intelligent system; nucleotides; pattern structures; phylogenetic relationships; prokaryotes phylogenetic tree; prokaryotic organisms; prokaryotic patterns; Bioinformatics; Buildings; Databases; Genomics; History; Indexing; Intelligent systems; Organisms; Pattern analysis; Phylogeny; K-string; composition vector; frequency vector; sequence pattern queries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
Type :
conf
DOI :
10.1109/BIBE.2007.4375677
Filename :
4375677
Link To Document :
بازگشت