DocumentCode :
2982655
Title :
A Stochastic Model for DNA Sequences Using Prescribed Nucleotide and Length Distributions
Author :
Bergen, Stuart W A ; Antoniou, Andreas
Author_Institution :
Dept. of Electr. & Comput. Eng., Victoria Univ., BC
fYear :
2006
fDate :
Aug. 2006
Firstpage :
95
Lastpage :
100
Abstract :
A stochastic model that generates artificial DNA sequences with correlation characteristics similar to those observed in real DNA sequences is proposed. A Bernoulli-like process is used to generate patches of DNA with nucleotide content representative of coding and noncoding region. Alternating coding and noncoding DNA patches are concatenated to form the sequence where the patch length is based on sample statistics. Examples demonstrate that the nonuniform use of codons in coding regions is responsible for the often-observed period-three property. The amplitude of the correlation corresponding to the period-three property is proportional to the coding-region length and inversely proportional to the noncoding-region length. The correlation characteristics of the complete M.tuberculosis, B.subtilis, and S.cerevisiae (chromosome XI) genomes exhibit two distinct branches corresponding to period three and nonperiod-three correlations like those observed for the artificial DNA sequences
Keywords :
DNA; genetics; stochastic processes; B.subtilis genome; Bernoulli-like process; DNA patches; M.tuberculosis genome; S.cerevisiae genome; artificial DNA sequences; chromosome XI genome; correlation characteristics; length distribution; noncoding region; nucleotide content representative; nucleotide distribution; period-three property; sample statistics; stochastic model; Autocorrelation; Bioinformatics; Biological cells; Character generation; DNA; Genomics; Information technology; Sequences; Signal processing; Stochastic processes; DNA modeling; Genomic DSP; period-three property;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2006 IEEE International Symposium on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9753-3
Electronic_ISBN :
0-7803-9754-1
Type :
conf
DOI :
10.1109/ISSPIT.2006.270777
Filename :
4042219
Link To Document :
بازگشت