• DocumentCode
    2786666
  • Title

    Building the Tree of Life on Terascale Systems

  • Author

    Feng, Xizhou ; Cameron, Kirk W. ; Sosa, Carlos P. ; Smith, Brian

  • Author_Institution
    Dept. of Comput. Sci., Virginia Tech., Blacksburg, VA
  • fYear
    2007
  • fDate
    26-30 March 2007
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Bayesian phylogenetic inference is an important alternative to maximum likelihood-based phylogenetic method. However, inferring large trees using the Bayesian approach is computationally demanding - requiring huge amounts of memory and months of computational time. With a combination of novel parallel algorithms and latest system technology, terascale phylogenetic tools provide biologists the computational power necessary to conduct experiments on very large dataset, and thus aid construction of the tree of life. In this work we evaluate the performance of PBPI, a parallel application that reconstructs phylogenetic trees using MCMC-based Bayesian methods, on two terascale systems, Blue Gene/L at IBM Rochester and System X at Virginia Tech. Our results confirm that for a benchmark dataset with 218 taxa and 10000 characters, PBPI can achieve linear speedup on 1024 or more processors for both systems.
  • Keywords
    Bayes methods; biology computing; evolution (biological); inference mechanisms; parallel algorithms; parallel machines; performance evaluation; tree data structures; very large databases; Bayesian phylogenetic inference; Blue Gene/L; System X; maximum likelihood-based phylogenetic method; parallel algorithms; phylogenetic trees; terascale phylogenetic tools; terascale systems; very large dataset; Bayesian methods; Biological information theory; Biology computing; Computer science; Concurrent computing; Evolution (biology); Kirk field collapse effect; Parallel algorithms; Phylogeny; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    1-4244-0910-1
  • Electronic_ISBN
    1-4244-0910-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2007.370214
  • Filename
    4227942