DocumentCode :
1595767
Title :
Sequence Length Requirement of Distance-Based Phylogeny Reconstruction: Breaking the Polynomial Barrier
Author :
Roch, Sébastien
Author_Institution :
Microsoft Res. Cambridge, Cambridge, MA
fYear :
2008
Firstpage :
729
Lastpage :
738
Abstract :
We introduce a new distance-based phylogeny reconstruction technique which provably achieves, at sufficiently short branch lengths, a sequence length requirement growing slower than any polynomial. The technique is based on a new averaging procedure that implicitly reconstructs ancestral sequences.In the same token, we extend previous results on phase transitions in phylogeny reconstruction to general time-reversible models. More precisely, we show that in the so-called Kesten-Stigum zone---roughly, a region of the parameter space where ancestral sequences are well approximated by ``linear combinations\´\´ of observed sequences---sequences of length eradiclog n suffice for reconstruction. Here n is the number of extant species. We improve this result to poly(log n) the ultrametric case. Surprisingly, this last result suggests that a UPGMA-type algorithm may in some sense be "optimal\´\´ under a molecular clock. Our results challenge---to some extent---the conventional wisdom that estimates of evolutionary distances alone carry significantly less information about phylogenies than full sequence datasets.
Keywords :
biology computing; polynomials; Kesten-Stigum zone; UPGMA-type algorithm; ancestral sequences; distance-based phylogeny reconstruction; evolutionary distances; polynomial barrier; sequence length requirement; short branch lengths; Computer science; Convergence; DNA; Linear approximation; Maximum likelihood estimation; Phylogeny; Physics; Polynomials; Probability; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location :
Philadelphia, PA
ISSN :
0272-5428
Print_ISBN :
978-0-7695-3436-7
Type :
conf
DOI :
10.1109/FOCS.2008.77
Filename :
4691005
Link To Document :
بازگشت