DocumentCode
2736778
Title
An improved maximum likelihood formulation for accurate genome assembly
Author
Varma, Aditya ; Ranade, Abhiram ; Aluru, Srinivas
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Bombay, Mumbai, India
fYear
2011
fDate
3-5 Feb. 2011
Firstpage
165
Lastpage
170
Abstract
We present improvements to the recently proposed maximum likelihood method for genome assembly. We formulate the problem as one of direct convex optimization, and achieve the following improvements: Our method does not require identical read lengths and can deal with reads of varying lengths. We eliminate the requirement to a priori know a stringent estimate of the length of the genome or the need to use further expectation minimization to predict the most likely length. Instead, we explicitly incorporate the uncertainty in the length estimate by a range parameter without affecting the convexity required for the optimization. Results indicate that our method can generate accurate estimates of repeat counts and produces fewer and much longer contigs. These results mark a further advancement of maximum likelihood genome assembly and the potential of this approach in building future genome assemblers.
Keywords
biology computing; genomics; molecular biophysics; optimisation; direct convex optimization; genome assembly; Assembly; Bioinformatics; DNA; Equations; Genomics; Mathematical model; Maximum likelihood detection; genome assembly; maximum likelihood; next-gen sequencing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location
Orlando, FL
Print_ISBN
978-1-61284-851-8
Type
conf
DOI
10.1109/ICCABS.2011.5729873
Filename
5729873
Link To Document