Title :
Maximum Likelihood Quantization of Genomic Features Using Dynamic Programming
Author :
Mingzhou Song ; Haralick, R.M. ; Boissinot, S.
Author_Institution :
New Mexico State Univ., Las Cruces
Abstract :
Dynamic programming is introduced to quantize a continuous random variable into a discrete random variable. Quantization is often useful before statistical analysis or reconstruction of large network models among multiple random variables. The quantization, through dynamic programming, finds the optimal discrete representation of the original probability density function of a random variable by maximizing the likelihood for the observed data. This algorithm is highly applicable to study genomic features such as the recombination rate across the chromosomes and the statistical properties of non-coding elements such as LINE1. In particular, the recombination rate obtained by quantization is studied for LINE1 elements that are grouped also using quantization by length. The exact and density-preserving quantization approach provides an alternative superior to the inexact and distance-based k-means clustering algorithm for discretization of a single variable.
Keywords :
dynamic programming; maximum likelihood estimation; probability; quantisation (signal); random processes; chromosomes; continuous random variable; discrete random variable; distance-based k-means clustering algorithm; dynamic programming; genomic features; maximum likelihood quantization; noncoding elements; optimal discrete representation; probability density function; recombination rate; statistical property; Bioinformatics; Biological cells; Computer science; Dynamic programming; Entropy; Genomics; Graphical models; Probability density function; Quantization; Random variables;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.36