Title :
More powerful discriminants for classifying phylogenetic signals in dinucleotide frequencies
Author :
Baran, Robert H. ; Jeon, Changwon ; Han, David K. ; Ko, Hanseok
Author_Institution :
Dept. of Electron. & Comput. Eng., Korea Univ., Seoul
fDate :
March 31 2008-April 4 2008
Abstract :
Microbial DNA fragments are classified according to species using compositional features and "genomic signatures" the oldest of which is the dinucleotide relative abundance profile defined by Karlin et al. More informative features, including higher order signatures, have demonstrated greater species-specificity in comparison to the baseline established by the dinucleotide signature using "delta-distance" to assess dissimilarity; but lack of standard methods has precluded rigorous comparison. We describe a new method for classifier evaluation that reduces any number of pair-wise inter-genomic comparisons to a single performance measure. To illustrate the method, we compare delta-distance to quadratic and linear discriminants prescribed by elementary pattern recognition theory, and find that the quadratic form is significantly more powerful.
Keywords :
DNA; genetics; medical signal processing; pattern recognition; signal classification; biomedical signal processing; compositional features; delta-distance; dinucleotide frequencies; dinucleotide relative abundance profile; dinucleotide signature; elementary pattern recognition theory; genomic signatures; higher order signatures; microbial DNA fragment classification; pair-wise inter-genomic comparisons; phylogenetic signal classification; quadratic form; Bioinformatics; Biomedical measurements; DNA; Detectors; Frequency; Genomics; Pattern classification; Phylogeny; Power engineering computing; Sequences; Biomedical signal processing; DNA; Error analysis; Pattern classification; Software performance;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517682