Title of article :
Rényi continuous entropy of DNA sequences
Author/Authors :
D. Vinga Szabo ، نويسنده , , Susana and Almeida، نويسنده , , Jonas S.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Pages :
12
From page :
377
To page :
388
Abstract :
Entropy measures of DNA sequences estimate their randomness or, inversely, their repeatability. L -block Shannon discrete entropy accounts for the empirical distribution of all length- L words and has convergence problems for finite sequences. A new entropy measure that extends Shannonʹs formalism is proposed. Rényiʹs quadratic entropy calculated with Parzen window density estimation method applied to CGR/USM continuous maps of DNA sequences constitute a novel technique to evaluate sequence global randomness without some of the former method drawbacks. The asymptotic behaviour of this new measure was analytically deduced and the calculation of entropies for several synthetic and experimental biological sequences was performed. The results obtained were compared with the distributions of the null model of randomness obtained by simulation. The biological sequences have shown a different p -value according to the kernel resolution of Parzenʹs method, which might indicate an unknown level of organization of their patterns. This new technique can be very useful in the study of DNA sequence complexity and provide additional tools for DNA entropy estimation. The main MATLAB applications developed and additional material are available at the webpage http://bioinformatics.musc.edu/renyi. Specialized functions can be obtained from the authors.
Keywords :
CGR/USM , Rényi entropy , Information theory , Parzenיs method , DNA
Journal title :
Journal of Theoretical Biology
Serial Year :
2004
Journal title :
Journal of Theoretical Biology
Record number :
1536706
Link To Document :
بازگشت