Title :
Upstream Sequence Finder-Tool to Find Out Upstream Element in Various Database or Genome.
Author :
Jha, Vineet ; Mazumder, Mohit ; Roy, Susanta
Author_Institution :
Bio Explore
Abstract :
Upstream elements are very significant in disclosing the property of the sequence not only they set a signal for the various protein to bind there but also help in locating hidden sequences and their property like TATA box. The whole idea about developing this algorithm is that to find out upstream sequences which carry hidden property like road signs which can alert drivers. In this case protein help user to predict and analyse the upstream sequences. We downloaded the database file (nucleotide file), query file and did the nBLAST. Then we parse the blast output to filter out full length sequences (sequences which are not truncated either from 5´ or 3´ end for more than 11 bases). The time complexity of algorithm was improved from exponential time complex to linear time complex by using the divide and conquer approach, where the large database file is divided into smaller files. This algorithm gives good hits and filters out the upstream element. One can even fix the option of having a gap or un-gapped alignment in the database.
Keywords :
biology computing; computational complexity; file organisation; genomics; molecular biophysics; very large databases; TATA box; hidden sequences location; large database file; linear time complex; nBLAST; query file; time complexity; upstream sequence finder-tool; Biochemistry; Bioinformatics; DNA; Databases; Filters; Genomics; Polymers; Proteins; RNA; Sequences;
Conference_Titel :
Advance Computing Conference, 2009. IACC 2009. IEEE International
Conference_Location :
Patiala
Print_ISBN :
978-1-4244-2927-1
Electronic_ISBN :
978-1-4244-2928-8
DOI :
10.1109/IADCC.2009.4809210