Title :
Joint base-calling of two DNA sequences with factor graphs
Author :
Shi, Xiaomeng ; Lun, Desmond S. ; Meldrim, Jim ; Kötter, Ralf ; Médard, Muriel
Author_Institution :
Lab. for Inf. & Decision Syst., Massachusetts Inst. of Technol., Cambridge, MA
fDate :
March 31 2008-April 4 2008
Abstract :
To improve the utility of existing technologies based on Sanger sequencing, this paper examines the possibility of base-calling two superposed DNA sequences jointly. This approach allows a single electrophoresis experiment to process two sequences, using the same quantity of reagents and machine hours as for a single sequence. A practical heuristic is proposed to first estimate the peak parameters, then separate them into two sequences (major/minor) by passing messages on a factor graph. Base-calling on the major alone yields accuracy commensurate with single sequence approaches, and joint base-calling provides results for the minor which, while being of lesser quality, incurs no additional cost and can be ultimately used in the genome assembly process.
Keywords :
DNA; genetics; graph theory; maximum likelihood estimation; optimisation; DNA sequences; Sanger sequencing; factor graphs; genome assembly process; joint base-calling; maximum a posteriori base estimation; peak parameter estimation; practical heuristic; single electrophoresis experiment; Assembly; Bioinformatics; Chemicals; Costs; DNA; Electrokinetics; Genomics; Intersymbol interference; Parameter estimation; Sequences; DNA sequencing; factor graphs;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518043