DocumentCode :
2415315
Title :
A Graphical Model Formulation of the DNA Base-Calling Problem
Author :
Andrade-Cetto, Lucio ; Manolakos, Elias S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
fYear :
2005
fDate :
28-28 Sept. 2005
Firstpage :
369
Lastpage :
374
Abstract :
A first order variable dependence (FOVD) probabilistic graphical model is introduced to capture the complex inter-event dependencies that are present in DNA sequencing data. In this framework, DNA base-calling is addressed as a parameter estimation problem using maximum likelihood methods. The FOVD model accounts for dependencies between neighboring alleles and statistically characterizes the size of signal peaks. Our experimental results suggest that the resulting unsupervised classification base-calling algorithms (i) achieve accuracy that exceeds on average that of the-state-of-the art base-callers, (ii) work well for a variety of data set types without requiring costly recalibration
Keywords :
DNA; biology computing; maximum likelihood estimation; medical signal processing; molecular biophysics; scientific information systems; DNA base-calling; DNA sequencing data; first order variable dependence; interevent dependency; maximum likelihood method; parameter estimation; probabilistic graphical model; unsupervised classification base-calling; DNA computing; Data engineering; Data mining; Digital signal processing; Graphical models; Maximum likelihood detection; Maximum likelihood estimation; Parameter estimation; Random variables; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing, 2005 IEEE Workshop on
Conference_Location :
Mystic, CT
Print_ISBN :
0-7803-9517-4
Type :
conf
DOI :
10.1109/MLSP.2005.1532931
Filename :
1532931
Link To Document :
بازگشت