• DocumentCode
    427652
  • Title

    DFT based DNA splicing algorithms for prediction of protein coding regions

  • Author

    Datta, Suprakash ; Asif, Amir

  • Author_Institution
    Dept. of Comput. Sci. Eng., York Univ., Toronto, Ont., Canada
  • Volume
    1
  • fYear
    2004
  • fDate
    7-10 Nov. 2004
  • Firstpage
    45
  • Abstract
    Identifying protein coding regions in DNA sequences is a fundamental step in computational recognition of genes. Traditional Discrete Fourier transform (DFT) based approaches exploit the empirical observation that the spectrum of protein coding DNA of length N nucleotides has a peak at frequency k=N/3 corresponding to the length of a DNA codon. In this paper, we prove the aforementioned and several other empirical observations attributed to DNA sequences. Our analytical results lead to faster and more accurate DFT-based algorithms for predicting coding regions. Further, our algorithm requires no prior training.
  • Keywords
    biocomputing; discrete Fourier transforms; genetics; medical signal processing; molecular biophysics; proteins; sequences; splicing; DFT; DNA codon sequence; DNA splicing algorithm; computational recognition; discrete Fourier transform; empirical observation; gene; nucleotide; protein coding region identification; Bioinformatics; DNA; Discrete Fourier transforms; Frequency; Genomics; Prediction algorithms; Protein engineering; Sequences; Signal processing algorithms; Splicing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
  • Print_ISBN
    0-7803-8622-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2004.1399084
  • Filename
    1399084