DocumentCode :
791170
Title :
A Three-State Model for DNA Protein-Coding Regions
Author :
Pinho, A.J. ; Neves, A.J.R. ; Afreixo, V. ; Bastos, C.A.C. ; Ferreira, P.J.S.G.
Author_Institution :
Signal Process. Lab., Aveiro Univ.
Volume :
53
Issue :
11
fYear :
2006
Firstpage :
2148
Lastpage :
2155
Abstract :
It is known that the protein-coding regions of DNA are usually characterized by a three-base periodicity. In this paper, we exploit this property, studying a DNA model based on three deterministic states, where each state implements a finite-context model. The experimental results obtained confirm the appropriateness of the proposed approach, showing compression gains in relation to the single finite-context model counterpart. Additionally, and potentially more interesting than the compression gain on its own, is the observation that the entropy associated to each of the three base positions of a codon differs and that this variation is not the same among the organisms analyzed
Keywords :
DNA; entropy; molecular biophysics; physiological models; proteins; DNA protein-coding regions; codon; compression gain; deterministic states; entropy; finite context model; three-base periodicity; three-state model; Amino acids; Bioinformatics; DNA; Data compression; Entropy; Genetics; Genomics; Proteins; Sequences; Signal processing; DNA compression; DNA sequence modeling; finite-context models; Algorithms; Base Sequence; Computer Simulation; DNA; Models, Genetic; Molecular Sequence Data; Open Reading Frames; Proteins; Sequence Alignment; Sequence Analysis, DNA;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/TBME.2006.879477
Filename :
1710155
Link To Document :
بازگشت