Title :
A parallel Euler approach for large-scale biological sequence assembly
Author :
Shi, Wei ; Zhou, Wanlei
Author_Institution :
Sch. of Inf. Technol., Deakin Univ., Burwood, Vic., Australia
Abstract :
Biological sequence assembly is an essential step for sequencing the genomes of organisms. Sequence assembly is very computing intensive especially for the large-scale sequence assembly. Parallel computing is an effective way to reduce the computing time and support the assembly for large amount of biological fragments. Euler sequence assembly algorithm is an innovative algorithm proposed recently. The advantage of this algorithm is that its computing complexity is polynomial and it provides a better solution to the notorious "repeat" problem. This paper introduces the parallelization of the Euler sequence assembly algorithm. Alt the Genome fragments generated by whole genome shotgun (WGS) will be assembled as a whole rather than dividing them into groups which may incurs errors due to the inaccurate group partition. The implemented system can be run on supercomputers, network of workstations or even network of PC computers. The experimental results have demonstrated the performance of our system.
Keywords :
biology computing; computational complexity; genetics; parallel algorithms; sequences; Euler sequence assembly; computing complexity; large-scale biological sequence assembly; parallel Euler approach; repeat problem; whole genome shotgun; Assembly; Bioinformatics; Biology computing; Concurrent computing; Genomics; Large-scale systems; Organisms; Parallel processing; Partitioning algorithms; Polynomials;
Conference_Titel :
Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
Print_ISBN :
0-7695-2316-1
DOI :
10.1109/ICITA.2005.41