DocumentCode
2415315
Title
A Graphical Model Formulation of the DNA Base-Calling Problem
Author
Andrade-Cetto, Lucio ; Manolakos, Elias S.
Author_Institution
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
fYear
2005
fDate
28-28 Sept. 2005
Firstpage
369
Lastpage
374
Abstract
A first order variable dependence (FOVD) probabilistic graphical model is introduced to capture the complex inter-event dependencies that are present in DNA sequencing data. In this framework, DNA base-calling is addressed as a parameter estimation problem using maximum likelihood methods. The FOVD model accounts for dependencies between neighboring alleles and statistically characterizes the size of signal peaks. Our experimental results suggest that the resulting unsupervised classification base-calling algorithms (i) achieve accuracy that exceeds on average that of the-state-of-the art base-callers, (ii) work well for a variety of data set types without requiring costly recalibration
Keywords
DNA; biology computing; maximum likelihood estimation; medical signal processing; molecular biophysics; scientific information systems; DNA base-calling; DNA sequencing data; first order variable dependence; interevent dependency; maximum likelihood method; parameter estimation; probabilistic graphical model; unsupervised classification base-calling; DNA computing; Data engineering; Data mining; Digital signal processing; Graphical models; Maximum likelihood detection; Maximum likelihood estimation; Parameter estimation; Random variables; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing, 2005 IEEE Workshop on
Conference_Location
Mystic, CT
Print_ISBN
0-7803-9517-4
Type
conf
DOI
10.1109/MLSP.2005.1532931
Filename
1532931
Link To Document