Title :
A Graphical Model Formulation of the DNA Base-Calling Problem
Author :
Andrade-Cetto, Lucio ; Manolakos, Elias S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
Abstract :
A first order variable dependence (FOVD) probabilistic graphical model is introduced to capture the complex inter-event dependencies that are present in DNA sequencing data. In this framework, DNA base-calling is addressed as a parameter estimation problem using maximum likelihood methods. The FOVD model accounts for dependencies between neighboring alleles and statistically characterizes the size of signal peaks. Our experimental results suggest that the resulting unsupervised classification base-calling algorithms (i) achieve accuracy that exceeds on average that of the-state-of-the art base-callers, (ii) work well for a variety of data set types without requiring costly recalibration
Keywords :
DNA; biology computing; maximum likelihood estimation; medical signal processing; molecular biophysics; scientific information systems; DNA base-calling; DNA sequencing data; first order variable dependence; interevent dependency; maximum likelihood method; parameter estimation; probabilistic graphical model; unsupervised classification base-calling; DNA computing; Data engineering; Data mining; Digital signal processing; Graphical models; Maximum likelihood detection; Maximum likelihood estimation; Parameter estimation; Random variables; Sequences;
Conference_Titel :
Machine Learning for Signal Processing, 2005 IEEE Workshop on
Conference_Location :
Mystic, CT
Print_ISBN :
0-7803-9517-4
DOI :
10.1109/MLSP.2005.1532931