DocumentCode :
667271
Title :
Identification and correction of substitution errors in Moleculo long reads
Author :
Price, Jack ; Ward, Jamie ; Udall, Joshua ; Snell, Quinn ; Clement, M.
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
fYear :
2013
fDate :
10-13 Nov. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Moleculo DNA sequencing technology provides extremely accurate, phased, reads having an average length of over 4,000 bp. Very little is yet known about the precise characteristics of these reads. We estimate a lower bound for the single nucleotide substitution error rate of these reads, and provide probabilities for each type of substitution. We also present preliminary work on the development of an error correction algorithm for these reads which in its current implementation corrects 74,030 single nucleotide errors in a Moleculo data set obtained from Rubus idaeus `Heritage´. We also demonstrate that the pattern of substitution errors shows no significant bias with respect to the position of an error along the body of a read.
Keywords :
DNA; bioinformatics; Rubus idaeus Heritage; error correction algorithm; moleculo DNA sequencing technology; moleculo long reads; single nucleotide substitution error rate; substitution error correction; substitution error identification; substitution error pattern; Accuracy; Assembly; Bioinformatics; Error analysis; Error correction; Genomics; Sequential analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/BIBE.2013.6701609
Filename :
6701609
Link To Document :
بازگشت