DocumentCode :
2855711
Title :
Information divergence measures-for detection of borders between coding and noncoding DNA regions using recursive entropic segmentation
Author :
Nicorici, Daniel ; Astola, Jaakko
Author_Institution :
Tampere Int. Center for Signal Process., Tampere Univ. of Technol., Finland
fYear :
2003
fDate :
28 Sept.-1 Oct. 2003
Firstpage :
577
Lastpage :
580
Abstract :
Entropy-based divergence measures have shown promising results in many areas of engineering and image processing. In this study, we use the Jensen-Shannon and Jensen-Renyi divergence measures for recursive segmentation of DNA sequences in order to find borders between coding and noncoding regions. Heterogeneous DNA sequences that are comprised of the four nucleotides A, C, G, and T and the stop codons can be partitioned into homogeneous domains. We introduce a new 18 symbol alphabet that captures: (i) the differential base composition in codons, and (ii) the differential stop codon composition along three phases in both DNA strands. For two entire genomes of bacteria our results obtained using the new approach, based on Jensen-Renyi divergence and the new 18 symbol alphabet, are more accurate than those obtained using the standard approach, based on Jensen-Shannon divergence, when searching for borders between coding and noncoding DNA regions.
Keywords :
DNA; entropy; image coding; image segmentation; medical image processing; Jensen-Renyi divergence; Jensen-Shannon divergence; borders detection; differential stop codon composition; entropic segmentation; entropy-based divergence measures; heterogeneous DNA sequences; image processing; noncoding DNA regions; Area measurement; Bioinformatics; DNA; Entropy; Genomics; Image coding; Image segmentation; Microorganisms; Sequences; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Statistical Signal Processing, 2003 IEEE Workshop on
Print_ISBN :
0-7803-7997-7
Type :
conf
DOI :
10.1109/SSP.2003.1289538
Filename :
1289538
Link To Document :
بازگشت