• DocumentCode
    2736778
  • Title

    An improved maximum likelihood formulation for accurate genome assembly

  • Author

    Varma, Aditya ; Ranade, Abhiram ; Aluru, Srinivas

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Bombay, Mumbai, India
  • fYear
    2011
  • fDate
    3-5 Feb. 2011
  • Firstpage
    165
  • Lastpage
    170
  • Abstract
    We present improvements to the recently proposed maximum likelihood method for genome assembly. We formulate the problem as one of direct convex optimization, and achieve the following improvements: Our method does not require identical read lengths and can deal with reads of varying lengths. We eliminate the requirement to a priori know a stringent estimate of the length of the genome or the need to use further expectation minimization to predict the most likely length. Instead, we explicitly incorporate the uncertainty in the length estimate by a range parameter without affecting the convexity required for the optimization. Results indicate that our method can generate accurate estimates of repeat counts and produces fewer and much longer contigs. These results mark a further advancement of maximum likelihood genome assembly and the potential of this approach in building future genome assemblers.
  • Keywords
    biology computing; genomics; molecular biophysics; optimisation; direct convex optimization; genome assembly; Assembly; Bioinformatics; DNA; Equations; Genomics; Mathematical model; Maximum likelihood detection; genome assembly; maximum likelihood; next-gen sequencing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-61284-851-8
  • Type

    conf

  • DOI
    10.1109/ICCABS.2011.5729873
  • Filename
    5729873