DocumentCode :
3424773
Title :
Iterative weighting of phylogenetic profiles increases classification accuracy
Author :
Craig, Roger ; Liao, Li
Author_Institution :
Dept. of Comput. & Inf. Sci., Delaware Univ., Newark, DE, USA
fYear :
2005
fDate :
15-17 Dec. 2005
Abstract :
Phylogenetic profiles of proteins - strings of ones and zeros encoding the presence and absence of proteins in a group of genomes - have been utilized to predict functionally linked proteins. In this work, we developed a method that incorporates into profile similarity the evolutionary relations that are represented in the phylogenetic tree of the genomes. The method extends the profile to encode the phylogenetic tree as extra bits, with scores reflecting the chances of interior nodes - hypothetical ancestral genomes of developing divergence in the descendants. The scoring scheme is refined with weighting factors that are collected from the training data and are iteratively updated from the predicted results. We tested the method on the proteome of Saccharomyces cerevisias - the budding yeast and used the MIPS classification as the benchmark. With such weighted phylogenetic profiles, the accuracy of our classifier - a support vector machine - was greatly increased.
Keywords :
biology computing; genetics; pattern classification; proteins; support vector machines; trees (mathematics); MIPS classification; Saccharomyces cerevisias; ancestral genomes; budding yeast; iterative phylogenetic profile weighting; phylogenetic protein profiles; phylogenetic tree; profile similarity; proteome; support vector machine; training data; Benchmark testing; Bioinformatics; Encoding; Fungi; Genomics; Phylogeny; Proteins; Support vector machine classification; Support vector machines; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2005. Proceedings. Fourth International Conference on
Print_ISBN :
0-7695-2495-8
Type :
conf
DOI :
10.1109/ICMLA.2005.44
Filename :
1607445
Link To Document :
بازگشت