• DocumentCode
    2582850
  • Title

    Discovery of repetitive patterns in DNA with accurate boundaries

  • Author

    Zheng, Jie ; Lonardi, Stefano

  • Author_Institution
    Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
  • fYear
    2005
  • fDate
    19-21 Oct. 2005
  • Firstpage
    105
  • Lastpage
    112
  • Abstract
    The accurate identification of repeats remains a challenging open problem in bioinformatics. Most existing methods of repeat identification either depend on annotated repeat databases or restrict repeats to pairs of similar sequences that are maximal in length. The fundamental flaw in most of the available methods is the lack of a definition that correctly balances the importance of the length and the frequency. In this paper, we propose a new definition of repeats that satisfies both criteria. We give a novel characterization of the building blocks of repeats, called elementary repeats, which leads to a natural definition of repeat boundaries. We design efficient algorithms and test them on synthetic and real biological data. Experimental results show that our method is highly accurate.
  • Keywords
    DNA; biology computing; molecular biophysics; molecular configurations; DNA sequences; accurate boundaries; annotated repeat databases; bioinformatics; elementary repeats; repeat identification; repetitive DNA patterns; restrict repeats; Bioinformatics; Biological information theory; DNA; Diseases; Frequency; Genetics; Genomics; Humans; Libraries; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
  • Print_ISBN
    0-7695-2476-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2005.23
  • Filename
    1544455