• DocumentCode
    1653465
  • Title

    A Fast Pattern Matching Algorithm for Biological Sequences

  • Author

    Huang, Yong ; Pan, Xuezeng ; Gao, Yunjun ; Cai, Guoyong

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Univ. Hangzhou, Hangzhou
  • fYear
    2008
  • Firstpage
    608
  • Lastpage
    611
  • Abstract
    With the remarkable increase in the number of DNA and proteins sequences, it is more important for the study of pattern matching in querying sequence patterns in the biological sequence database. To further raise the performance of the pattern matching algorithm, a fast exact algorithm (called ZTBMH), which is a variation of Zhu-Takaoka algorithm, is presented. It absorbs the idea of Boyer-Moore-Horspool algorithm, which utilizes only bad character heuristic and reduces the number of comparisons, thus improves the performance in practice. The best, worst and average cases in time complexities of the new algorithm are also discussed in this paper. The experimental results show that the proposed algorithm works better than other compared algorithms, especially in case of small alphabets such as nucleotides sequences, and thus the proposed algorithm is quite applicable for exact pattern matching in biological sequences.
  • Keywords
    DNA; biological techniques; biology computing; molecular biophysics; proteins; Boyer-Moore-Horspool algorithm; DNA sequences; ZTBMH; Zhu-Takaoka algorithm; biological sequences; fast exact algorithm; fast pattern matching algorithm; nucleotide sequences; protein sequences; Biology; Computer science; DNA; Databases; Educational institutions; History; Paper technology; Pattern matching; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1747-6
  • Electronic_ISBN
    978-1-4244-1748-3
  • Type

    conf

  • DOI
    10.1109/ICBBE.2008.148
  • Filename
    4535028