• DocumentCode
    2415315
  • Title

    A Graphical Model Formulation of the DNA Base-Calling Problem

  • Author

    Andrade-Cetto, Lucio ; Manolakos, Elias S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
  • fYear
    2005
  • fDate
    28-28 Sept. 2005
  • Firstpage
    369
  • Lastpage
    374
  • Abstract
    A first order variable dependence (FOVD) probabilistic graphical model is introduced to capture the complex inter-event dependencies that are present in DNA sequencing data. In this framework, DNA base-calling is addressed as a parameter estimation problem using maximum likelihood methods. The FOVD model accounts for dependencies between neighboring alleles and statistically characterizes the size of signal peaks. Our experimental results suggest that the resulting unsupervised classification base-calling algorithms (i) achieve accuracy that exceeds on average that of the-state-of-the art base-callers, (ii) work well for a variety of data set types without requiring costly recalibration
  • Keywords
    DNA; biology computing; maximum likelihood estimation; medical signal processing; molecular biophysics; scientific information systems; DNA base-calling; DNA sequencing data; first order variable dependence; interevent dependency; maximum likelihood method; parameter estimation; probabilistic graphical model; unsupervised classification base-calling; DNA computing; Data engineering; Data mining; Digital signal processing; Graphical models; Maximum likelihood detection; Maximum likelihood estimation; Parameter estimation; Random variables; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2005 IEEE Workshop on
  • Conference_Location
    Mystic, CT
  • Print_ISBN
    0-7803-9517-4
  • Type

    conf

  • DOI
    10.1109/MLSP.2005.1532931
  • Filename
    1532931