DocumentCode :
3477400
Title :
ConPath: Scaffold Analysis Tool Using Mate-Pair Information for Genome Sequencing
Author :
Kim, Pan-Gyu ; Cho, Hwan-Gue ; Park, Kiejung
Author_Institution :
SmallSoft Co. Ltd., Daejeon
fYear :
2007
fDate :
11-13 Oct. 2007
Firstpage :
55
Lastpage :
59
Abstract :
We have developed a Windows-based program, ConPath, as a scaffold analyzer which constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find out the longest acyclic graphs. Using end-read pairs of fixed-sized mate-pair library, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pairs, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including M. succiniciproducens [12] and V.vulinificus, where we have verified contig assembly by finding out some erroneous contigs with four kinds of error types defined in ConPath. It also supports some convenient features and viewers to investigate each contig in detail, like contig viewer, scaffold viewer, edge information list, mate-pair list, printing complex scaffolds structures, and so on.
Keywords :
DNA; biology computing; cellular biophysics; microorganisms; molecular biophysics; tissue engineering; ConPath; M. succiniciproducens; V.vulinificus; Windows-based program; contig algorithm; edge information list; end-read pairs; fixed-sized mate-pair library; genome sequencing; mate-pair information; microbial genome projects; printing complex scaffold structures; scaffold analysis tool; separate sequence contig ordering; separate sequence contig orienting; Assembly; Bioinformatics; DNA; Genomics; Information analysis; Information technology; Joining processes; Libraries; Sequences; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in the Convergence of Bioscience and Information Technologies, 2007. FBIT 2007
Conference_Location :
Jeju City
Print_ISBN :
978-0-7695-2999-8
Type :
conf
DOI :
10.1109/FBIT.2007.119
Filename :
4524079
Link To Document :
بازگشت