مرکز منطقه ای اطلاع رساني علوم و فناوري - A Suffix Tree Construction Algorithm for DNA Sequences

DocumentCode :

1991984

Title :

A Suffix Tree Construction Algorithm for DNA Sequences

Author :

Huo, Hongwei ; Stojkovic, Vojislav

Author_Institution :

School of Computer Science and Technology, Xidian University, Xi´an 710071, China. hwhuo@mail.xidian.edu.cn

fYear :

2007

fDate :

14-17 Oct. 2007

Firstpage :

1178

Lastpage :

1182

Abstract :

The suffix tree is a powerful data structure in string processing and DNA sequence comparisons. However, constructing suffix trees being very greedy in space is a fatal drawback. In addition, the performance of the suffix tree construction using suffix link will rapidly degrade with the increase of the scale of sequences to be handled because of the random access. In order to overcome these disadvantages, a new bit layout is used for the nodes of a suffix tree which has less space requirements. Based on this an algorithm to construct suffix tree for DNA sequences is proposed using partitioning strategies. The effectiveness for the proposed algorithm is shown in the testing cases from NCBI web site. Comparisons with Kurtz´s algorithm in space requirements and running time have been made in the experiments. The results show that the proposed algorithm is memory-efficient and has a better performance over Kurtz´s algorithm on the average running time.

Keywords :

DNA; biology computing; tree data structures; DNA sequences; Kurtz algorithm; NCBI web site; bit layout; string processing; suffix tree construction algorithm; Binary search trees; Bioinformatics; Computer science; DNA; Degradation; Genomics; Partitioning algorithms; Performance loss; Sequences; Tree data structures;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on

Conference_Location :

Boston, MA

Print_ISBN :

978-1-4244-1509-0

Type :

conf

DOI :

10.1109/BIBE.2007.4375711

Filename :

4375711

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1991984