Title of article
Rényi continuous entropy of DNA sequences
Author/Authors
D. Vinga Szabo ، نويسنده , , Susana and Almeida، نويسنده , , Jonas S.، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2004
Pages
12
From page
377
To page
388
Abstract
Entropy measures of DNA sequences estimate their randomness or, inversely, their repeatability. L -block Shannon discrete entropy accounts for the empirical distribution of all length- L words and has convergence problems for finite sequences. A new entropy measure that extends Shannonʹs formalism is proposed. Rényiʹs quadratic entropy calculated with Parzen window density estimation method applied to CGR/USM continuous maps of DNA sequences constitute a novel technique to evaluate sequence global randomness without some of the former method drawbacks. The asymptotic behaviour of this new measure was analytically deduced and the calculation of entropies for several synthetic and experimental biological sequences was performed. The results obtained were compared with the distributions of the null model of randomness obtained by simulation. The biological sequences have shown a different p -value according to the kernel resolution of Parzenʹs method, which might indicate an unknown level of organization of their patterns. This new technique can be very useful in the study of DNA sequence complexity and provide additional tools for DNA entropy estimation. The main MATLAB applications developed and additional material are available at the webpage http://bioinformatics.musc.edu/renyi. Specialized functions can be obtained from the authors.
Keywords
CGR/USM , Rényi entropy , Information theory , Parzenיs method , DNA
Journal title
Journal of Theoretical Biology
Serial Year
2004
Journal title
Journal of Theoretical Biology
Record number
1536706
Link To Document