• Title of article

    Rényi continuous entropy of DNA sequences

  • Author/Authors

    D. Vinga Szabo ، نويسنده , , Susana and Almeida، نويسنده , , Jonas S.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2004
  • Pages
    12
  • From page
    377
  • To page
    388
  • Abstract
    Entropy measures of DNA sequences estimate their randomness or, inversely, their repeatability. L -block Shannon discrete entropy accounts for the empirical distribution of all length- L words and has convergence problems for finite sequences. A new entropy measure that extends Shannonʹs formalism is proposed. Rényiʹs quadratic entropy calculated with Parzen window density estimation method applied to CGR/USM continuous maps of DNA sequences constitute a novel technique to evaluate sequence global randomness without some of the former method drawbacks. The asymptotic behaviour of this new measure was analytically deduced and the calculation of entropies for several synthetic and experimental biological sequences was performed. The results obtained were compared with the distributions of the null model of randomness obtained by simulation. The biological sequences have shown a different p -value according to the kernel resolution of Parzenʹs method, which might indicate an unknown level of organization of their patterns. This new technique can be very useful in the study of DNA sequence complexity and provide additional tools for DNA entropy estimation. The main MATLAB applications developed and additional material are available at the webpage http://bioinformatics.musc.edu/renyi. Specialized functions can be obtained from the authors.
  • Keywords
    CGR/USM , Rényi entropy , Information theory , Parzenיs method , DNA
  • Journal title
    Journal of Theoretical Biology
  • Serial Year
    2004
  • Journal title
    Journal of Theoretical Biology
  • Record number

    1536706