DocumentCode
1633622
Title
Detection of 3-periodicity for small genomic sequences based on AR technique
Author
Rao, Nini ; Shepherd, Simon J.
Author_Institution
Sch. of Life Sci. & Technol., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Volume
2
fYear
2004
Firstpage
1032
Abstract
The major signal in protein coding regions of genomic sequences is f=1/3 periodicity; some methods such as FFT-based methods, autocorrelation, mutual information function, etc., which exploit this phenomenon, rapidly lose effectiveness in the case of small DNA sequences when attempting to detect 3-periodicity. The paper proposes an AR technique as an alternative tool for this purpose, due to its improved coding region resolution for small data records. Theoretical analysis and experimental results show that the detection resolution for the AR technique is higher than that of Fourier methods for small DNA sequences. The sequence length and structure are the main factors that affect performance of any detection method. However, AR methods are more robust against variations in these factors. Unlike neural net-based methods, no a priori knowledge of the sequences is required. Hence, the AR technique appears to be a useful tool for 3-periodicity (including other periodicities), repeat and regulatory regions of unknown genomic sequences, especially small genomic sequence.
Keywords
DNA; autoregressive processes; maximum entropy methods; proteins; sequences; spectral analysis; 3-periodicity detection; AR technique; DNA sequence length; DNA sequence structure; FFT; Fourier methods; autocorrelation; coding region resolution; detection resolution; genomic sequences; maximum entropy spectral analysis; mutual information function; neural net-based methods; protein coding sequences; Autocorrelation; Bioinformatics; DNA computing; Frequency; Genomics; Mutual information; Protein engineering; Robustness; Sequences; Spectral analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Circuits and Systems, 2004. ICCCAS 2004. 2004 International Conference on
Print_ISBN
0-7803-8647-7
Type
conf
DOI
10.1109/ICCCAS.2004.1346354
Filename
1346354
Link To Document