DocumentCode
1762202
Title
Data Requirement for Phylogenetic Inference from Multiple Loci: A New Distance Method
Author
Dasarathy, Gautam ; Nowak, Robert ; Roch, Sebastien
Author_Institution
Wisconsin Inst. for Discovery, Madison, WI, USA
Volume
12
Issue
2
fYear
2015
fDate
March-April 2015
Firstpage
422
Lastpage
432
Abstract
We consider the problem of estimating the evolutionary history of a set of species (phylogeny or species tree) from several genes. It is known that the evolutionary history of individual genes (gene trees) might be topologically distinct from each other and from the underlying species tree, possibly confounding phylogenetic analysis. A further complication in practice is that one has to estimate gene trees from molecular sequences of finite length. We provide the first full data-requirement analysis of a species tree reconstruction method that takes into account estimation errors at the gene level. Under that criterion, we also devise a novel reconstruction algorithm that provably improves over all previous methods in a regime of interest.
Keywords
bioinformatics; data analysis; evolution (biological); genetic algorithms; genetics; molecular biophysics; molecular configurations; trees (mathematics); data requirement; distance method; estimation errors; evolutionary history estimation; finite length molecular sequences; full data-requirement analysis; gene trees; multiple loci; phylogenetic inference; regime-of-interest; species tree reconstruction algorithm; Bioinformatics; History; Phylogeny; Reconstruction algorithms; Sociology; Statistics; Vegetation; Phylogenetic inference; distance methods; incomplete lineage sorting; molecular clock; multispecies coalescent; phylogenetic inference; sample complexity;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2014.2361685
Filename
6917051
Link To Document