DocumentCode :
1633622
Title :
Detection of 3-periodicity for small genomic sequences based on AR technique
Author :
Rao, Nini ; Shepherd, Simon J.
Author_Institution :
Sch. of Life Sci. & Technol., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Volume :
2
fYear :
2004
Firstpage :
1032
Abstract :
The major signal in protein coding regions of genomic sequences is f=1/3 periodicity; some methods such as FFT-based methods, autocorrelation, mutual information function, etc., which exploit this phenomenon, rapidly lose effectiveness in the case of small DNA sequences when attempting to detect 3-periodicity. The paper proposes an AR technique as an alternative tool for this purpose, due to its improved coding region resolution for small data records. Theoretical analysis and experimental results show that the detection resolution for the AR technique is higher than that of Fourier methods for small DNA sequences. The sequence length and structure are the main factors that affect performance of any detection method. However, AR methods are more robust against variations in these factors. Unlike neural net-based methods, no a priori knowledge of the sequences is required. Hence, the AR technique appears to be a useful tool for 3-periodicity (including other periodicities), repeat and regulatory regions of unknown genomic sequences, especially small genomic sequence.
Keywords :
DNA; autoregressive processes; maximum entropy methods; proteins; sequences; spectral analysis; 3-periodicity detection; AR technique; DNA sequence length; DNA sequence structure; FFT; Fourier methods; autocorrelation; coding region resolution; detection resolution; genomic sequences; maximum entropy spectral analysis; mutual information function; neural net-based methods; protein coding sequences; Autocorrelation; Bioinformatics; DNA computing; Frequency; Genomics; Mutual information; Protein engineering; Robustness; Sequences; Spectral analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Circuits and Systems, 2004. ICCCAS 2004. 2004 International Conference on
Print_ISBN :
0-7803-8647-7
Type :
conf
DOI :
10.1109/ICCCAS.2004.1346354
Filename :
1346354
Link To Document :
بازگشت