• DocumentCode
    1641750
  • Title

    Phylogenetic reconstruction with disk-covering and Bayesian approaches

  • Author

    Guo, Yan ; Ye, Fei ; Tang, Jijun

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of South Carolina, Columbia, SC
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The DCM approach is commonly used to divide the dataset into smaller subproblems, analyze each subproblem using a base method to obtain subtrees, then recombine these subtrees to build the final phylogeny over the whole dataset. In recent years, the new and improved method MrBayes, a Bayesian Markov Chain Monte Carlo (MCMC) approach is widely used for phylogeny analysis. In this paper, a new method for large scale Bayesian phylogeny analysis is proposed. This new method (DCM3-MrBayes) is an improved version of Rec-I-DCM3 (recursive iterative disk-covering method), which uses a divide-and-conquer approach and is designed for large dataset analysis. To integrate MrBayes with Rec-I-DCM3, we have to deal with some unique problems and proposed several methods to tackle these problems. Our improvements include a cache system that can avoid unnecessary computations and a method to eliminate weak branches indicated by the Bayesian analysis to filter out potential bad branches. Our experiments on simulated datasets shows promising improvement over the original DCM. One of the most important advantages of using Bayesian method for phylogeny reconstruction is being able to calculate the posterior probabilities. A divide-and-conquer Bayesian method looses its ability to calculate the posterior probabilities due to the fact that each subproblem generates its own posterior probabilities, which posts some difficulties for obtaining the posterior probability for the whole problem. In order to preserve the advantage of Bayesian approach, we also introduce an algorithm that calculates the posterior probabilities of the whole phylogeny from the subproblemspsila posterior probabilities.
  • Keywords
    Bayes methods; Markov processes; Monte Carlo methods; biology computing; evolution (biological); iterative methods; trees (mathematics); Bayesian Markov Chain Monte Carlo approach; Bayesian approach; MrBayes method; disk covering approach; divide-and-conquer approach; large scale Bayesian phylogeny; phylogenetic reconstruction; phylogeny analysis; posterior probability; recursive iterative disk-covering method; subtrees; Bayesian methods; Evolution (biology); Filters; History; Iterative methods; Large-scale systems; Monte Carlo methods; Phylogeny; Probability; Reconstruction algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4244-2844-1
  • Electronic_ISBN
    978-1-4244-2845-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2008.4696745
  • Filename
    4696745