• DocumentCode
    3143126
  • Title

    Computing the Phylogenetic Likelihood Function Out-of-Core

  • Author

    Izquierdo-Carrasco, Fernando ; Stamatakis, Alexandros

  • Author_Institution
    Exelixis Lab., Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    444
  • Lastpage
    451
  • Abstract
    The computation of the phylogenetic likelihood function for reconstructing evolutionary trees from molecular sequence data is both memory- and compute-intensive. Based on our experience with the user community of RAxML, memory-shortages (as opposed to CPU time limitations) are currently the prevalent problem regarding resource availability, that is, lack of memory hinders large-scale biological analyses. To this end, we study the performance of an out-of-core execution of the phylogenetic likelihood function by means of a proof-of-concept implementation in RAxML. We find that RAM miss rates are below 10%, even if only 5% of the required data structures are held in RAM. Moreover, we show that our proof-of-concept implementation runs more than 5 times faster than the respective standard implementation when paging is used. The concepts presented here can be applied to all programs that rely on the phylogenetic likelihood function and can contribute significantly to enabling the computation of whole-genome phylogenies.
  • Keywords
    biology computing; genetics; molecular biophysics; random-access storage; tree data structures; RAM miss rates; RAxML; biological analysis; data structure; evolutionary tree reconstruction; memory-shortages; molecular sequence data; out-of-core execution; phylogenetic likelihood function out-of-core; whole-genome phylogenies; Algorithm design and analysis; DNA; Data structures; Memory management; Phylogeny; Random access memory; Vegetation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.185
  • Filename
    6008864