DocumentCode
475858
Title
Parallel Implementation of the Novel Approach to Genome Assembly
Author
Blazewicz, Jacek ; Kasprzak, Marta ; Swiercz, Aleksandra ; Figlerowicz, Marek ; Gawron, Piotr ; Platt, Darren ; Szajkowski, Lukasz
Author_Institution
Inst. of Comput. Sci., Poznan Univ. of Technol., Poznan
fYear
2008
fDate
6-8 Aug. 2008
Firstpage
732
Lastpage
737
Abstract
DNA assembly problem is well known for its high complexity both on biological and computational levels. Traditional laboratory approach to the problem, which involves DNA sequencing by hybridization or by gel electrophoresis, entails a lot of errors coming from experimental and algorithmic stages. DNA sequences constituting the traditional assembly input have lengths about a few hundreds of nucleotides and they cover each other rather sparsely. A new biochemical approach to DNA sequencing gives highly reliable output of relatively low cost and in short time. It is 454 sequencing, based on the pyrosequencing protocol, owned by 454 Life Sciences Corporation. The produced sequences are shorter (about 100-200 nucleotides) but their coverage in the assembled sequence is very dense. In the paper, we propose a parallel implementation of an algorithm dealing well with such data and outperforming other assembly algorithms used in practice. The algorithm is a heuristic based on a graph model, the graph being built on the set of input sequences. Computational tests were performed on real data obtained from the 454 sequencer during sequencing the genome of bacteria Prochlorococcus marinus.
Keywords
DNA; biology computing; graph theory; optimisation; parallel algorithms; DNA assembly problem; DNA sequencing; bacteria Prochlorococcus marinus; biochemical approach; gel electrophoresis; genome assembly; genome sequencing; graph model; heuristic; nucleotides; parallel implementation; pyrosequencing protocol; traditional laboratory approach; Assembly; Bioinformatics; Biology computing; Costs; DNA computing; Electrokinetics; Genomics; Laboratories; Protocols; Sequences; 454 sequencing; DNA assembly; bioinformatics; graphs; heuristics; parallel implementation;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location
Phuket
Print_ISBN
978-0-7695-3263-9
Type
conf
DOI
10.1109/SNPD.2008.47
Filename
4617459
Link To Document