• DocumentCode
    3388789
  • Title

    Non-Stationary Analysis of DNA Sequences

  • Author

    Bouaynaya, Nidhal ; Schonfeld, Dan

  • Author_Institution
    Department of Systems Engineering, University of Arkansas at Little Rock.
  • fYear
    2007
  • fDate
    26-29 Aug. 2007
  • Firstpage
    200
  • Lastpage
    204
  • Abstract
    Previous searches for long-range correlations in DNA sequences was carried out using statistical tools for stationary signals. However, genomic signals are non-stationary as can be attested by standard statistical tests for stationarity. In this paper, we address, in the light of non-stationary time-series analysis, the questions of (i) the existence of long-range correlations in DNA sequences and (ii) whether they are present in both coding and non-coding segments or only in the latter. It turns out that the statistical differences between coding and non-coding segments are more subtle than previously claimed by the stationary analysis. Both coding and non-coding sequences exhibit long-range correlations, as asserted by an evolutionary 1/f spectrum (i.e., having a time-dependent spectral exponent). Moreover, the average spectral exponent of non-coding segments is higher than its counterpart for coding segments. To prove that this observation is not an artifact of the 1/f evolutionary spectrum, we show, using an index of randomness that we derive from the frequency-time distribution of the genomic signals, that coding sequences are "more random" (i.e., whiter) than non-coding sequences. We believe that this result is likely the source of confusion and controversy in previous work, which relied on stationary analysis of DNA correlations.
  • Keywords
    Alzheimer´s disease; Autocorrelation; Bioinformatics; DNA; Genomics; Sequences; Signal analysis; Testing; Time series analysis; White noise; Hilbert transform; Non-stationary time-series analysis; empirical mode decomposition (EMD); evolutionary periodogram; evolutionary spectrum;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing, 2007. SSP '07. IEEE/SP 14th Workshop on
  • Conference_Location
    Madison, WI, USA
  • Print_ISBN
    978-1-4244-1198-6
  • Electronic_ISBN
    978-1-4244-1198-6
  • Type

    conf

  • DOI
    10.1109/SSP.2007.4301247
  • Filename
    4301247