• DocumentCode
    772598
  • Title

    Latent Periodicities in Genome Sequences

  • Author

    Arora, Raman ; Sethares, William A. ; Bucklew, James A.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI
  • Volume
    2
  • Issue
    3
  • fYear
    2008
  • fDate
    6/1/2008 12:00:00 AM
  • Firstpage
    332
  • Lastpage
    342
  • Abstract
    A novel approach is presented for the detection of periodicities in DNA sequences. A DNA sequence can be modelled as a nonstationary stochastic process that exhibits various statistical periodicities over different regions. The coding part of the DNA, for instance, exhibits statistical periodicity with period three. Such regions in DNA are modelled as generated from a collection of information sources (with an underlying probability distribution) in a cyclic manner, thus exhibiting cyclostationarity. The maximum likelihood estimates are developed for the distributions of the information sources and for the statistical period of the DNA sequence. Such probabilistic sources are further investigated for decomposition into constituent cyclostationary sources. Since the symbolic sources do not admit an algebraic structure, a composition of cyclostationary probabilistic sources is studied that models DNA replication process. This composition is shown to give a rich mathematical structure on the collection of cyclostationary sources and allows a uniqueness theorem for the decomposition.
  • Keywords
    DNA; genetics; maximum likelihood estimation; molecular biophysics; sequences; statistical distributions; stochastic processes; DNA coding; DNA sequence; algebraic structure; cyclostationary probabilistic source; genome sequence; maximum likelihood estimation; nonstationary stochastic process; statistical latent periodicity; Bioinformatics; DNA; Fourier transforms; Genetic mutations; Genomics; Probability distribution; Proteins; Sequences; Signal processing; Stochastic processes; Cyclostationarity; gene replication; genomic signal processing; symbolic periodicity; symbolic sequences;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2008.923861
  • Filename
    4550546