DocumentCode
2582850
Title
Discovery of repetitive patterns in DNA with accurate boundaries
Author
Zheng, Jie ; Lonardi, Stefano
Author_Institution
Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
fYear
2005
fDate
19-21 Oct. 2005
Firstpage
105
Lastpage
112
Abstract
The accurate identification of repeats remains a challenging open problem in bioinformatics. Most existing methods of repeat identification either depend on annotated repeat databases or restrict repeats to pairs of similar sequences that are maximal in length. The fundamental flaw in most of the available methods is the lack of a definition that correctly balances the importance of the length and the frequency. In this paper, we propose a new definition of repeats that satisfies both criteria. We give a novel characterization of the building blocks of repeats, called elementary repeats, which leads to a natural definition of repeat boundaries. We design efficient algorithms and test them on synthetic and real biological data. Experimental results show that our method is highly accurate.
Keywords
DNA; biology computing; molecular biophysics; molecular configurations; DNA sequences; accurate boundaries; annotated repeat databases; bioinformatics; elementary repeats; repeat identification; repetitive DNA patterns; restrict repeats; Bioinformatics; Biological information theory; DNA; Diseases; Frequency; Genetics; Genomics; Humans; Libraries; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN
0-7695-2476-1
Type
conf
DOI
10.1109/BIBE.2005.23
Filename
1544455
Link To Document