DocumentCode
3460425
Title
An Adaptive Suffix Tree Based Algorithm for Repeats Recognition in a DNA Sequence
Author
Huo, Hongwei ; Wang, Xiaowu ; Stojkovic, Vojislav
Author_Institution
Sch. of Comput. Sci. & Tech., Xidian Univ., Xi´´an, China
fYear
2009
fDate
3-5 Aug. 2009
Firstpage
181
Lastpage
184
Abstract
Many methods for repeats recognition are based on alignments. Their speed and time significantly limit their applications. This paper presents the fast Rep(eats)Seeker algorithm for repeats recognition based on the adaptive Ukkonen algorithm for a suffix tree construction. The RepSeeker algorithm uses the lowest frequency limit to maximize the extension of repeats. The adaptive improvements to the Ukkonen suffix tree construction are made to increase the efficiency of the RepSeeker algorithm. The node information required by the RepSeeker algorithm is added during the suffix tree construction. Because information in leaves and branch nodes are different, the RepSeeker algorithm directly obtains the needed information from nodes to find out the frequency and locate the positions of the substring. The improvement is noticeable for the repeats recognition. Comparisons between before and after improvements of the suffix tree construction show that improvements greatly reduce the running time of the RepSeeker algorithm without losing the accuracy.
Keywords
DNA; bioinformatics; pattern recognition; trees (mathematics); DNA sequence; RepSeeker algorithm; Ukkonen suffix tree construction; adaptive Ukkonen algorithm; adaptive suffix tree based algorithm; bioinformatics; repeats recognition; Bioinformatics; Computer science; DNA; Diseases; Frequency; Genomics; Humans; Libraries; Sequences; Systems biology; RepSeeker algorithm; Repeats recognition; Ukkonen algorithm; adaptive suffix tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3739-9
Type
conf
DOI
10.1109/IJCBS.2009.65
Filename
5260704
Link To Document