DocumentCode
3330971
Title
Nearly tight bounds on the learnability of evolution
Author
Ambainis, Andris ; Desper, Richard ; Farach, Martin ; Kannan, Sampath
Author_Institution
Latvian State Univ., Riga, Latvia
fYear
1997
fDate
20-22 Oct 1997
Firstpage
524
Lastpage
533
Abstract
Evolution is often modeled as a stochastic process which modifies DNA. One of the most popular and successful such processes are the Cavender-Farris (CF) trees, which are represented as edge weighted trees. The Phylogeny Construction Problem is that of, given κ samples drawn from a CF tree, output a CF tree which is close to the original. Each CF tree naturally defines a random variable, and the gold standard for reconstructing such trees is the maximum likelihood estimator of this variable. This approach is notoriously computationally expensive. We show that a very simple algorithm, which is a variant on one of the most popular algorithms used by practitioners, converges on the true tree at a rate which differs from the optimum by a constant. We do this by analyzing upper and lower bounds for the convergence rate of learning very simple CF trees, and then show that the learnability of each CF tree is sandwiched between two such simpler trees. Our results rely on the fact that, if the right metric is used, the likelihood space of CF trees is smooth
Keywords
genetic algorithms; learning (artificial intelligence); maximum likelihood estimation; stochastic processes; trees (mathematics); CF tree; Cavender-Farris trees; DNA; Phylogeny Construction Problem; computationally expensive; convergence rate; edge weighted trees; evolution learnability; maximum likelihood estimator; nearly tight bounds; random variable; stochastic process; Computer science; Convergence; DNA; Evolution (biology); Mathematics; Maximum likelihood estimation; Morphology; Phylogeny; Random variables; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science, 1997. Proceedings., 38th Annual Symposium on
Conference_Location
Miami Beach, FL
ISSN
0272-5428
Print_ISBN
0-8186-8197-7
Type
conf
DOI
10.1109/SFCS.1997.646141
Filename
646141
Link To Document