• DocumentCode
    491804
  • Title

    Basic Sequence Search by Hashing Algorithm in DNA Sequence Databases

  • Author

    Toh, Sing-Hui ; Lee, Hoon-Jae ; Do, Kyeong-Hoon

  • Author_Institution
    Dept. of Ubiquitous IT, Dongseo Univ., Busan
  • Volume
    03
  • fYear
    2009
  • fDate
    15-18 Feb. 2009
  • Firstpage
    2317
  • Lastpage
    2320
  • Abstract
    We designed a new genome search tool, basic sequence search by hashing algorithm (BSSHA) bases on basic local alignment search tool (BLAST) and, sequence search and alignment by hashing algorithm (SSAHA) for DNA sequence databases. Preprocess query sequence by making m-letter word list. Sequences in the database are preprocessed by breaking them into k-tuples of k-contiguous bases and hash table is created to store the position of each occurrence of each k-tuple. "Linked list" method is applied to search a query sequence in the database by detecting the hits in the hash table. Hash table for database is generated only once since it can be saved in the memory for further use. BSSHA depends on exact matching pair, so it enables execution time shorter by skipping those empty hit list or low possibility matches. Time complexity theory shows that the execution time for BSSHA is shorter than SSAHA.
  • Keywords
    DNA; biology computing; computational complexity; database management systems; genetics; query formulation; sequences; BLAST; DNA sequence databases; SSAHA; basic local alignment search tool; basic sequence search; genome search tool; hash table; k-contiguous bases; k-tuples; linked list method; query sequence; sequence search and alignment by hashing algorithm; time complexity; Algorithm design and analysis; Bioinformatics; Complexity theory; DNA computing; Data engineering; Design engineering; Genomics; Pervasive computing; Sequences; Spatial databases; Query sequence; hash table; matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology, 2009. ICACT 2009. 11th International Conference on
  • Conference_Location
    Phoenix Park
  • ISSN
    1738-9445
  • Print_ISBN
    978-89-5519-138-7
  • Electronic_ISBN
    1738-9445
  • Type

    conf

  • Filename
    4809542