• DocumentCode
    3146870
  • Title

    Computing the Tree of Life: Leveraging the Power of Desktop and Service Grids

  • Author

    Bazinet, Adam L. ; Cummings, Michael P.

  • Author_Institution
    Center for Bioinf. & Comput. Biol., Univ. of Maryland, College Park, MD, USA
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    1896
  • Lastpage
    1902
  • Abstract
    The trend in life sciences research, particularly in molecular evolutionary systematics, is toward larger data sets and ever-more detailed evolutionary models, which can generate substantial computational loads. Over the past several years we have developed a grid computing system aimed at providing researchers the computational power needed to complete such analyses in a timely manner. Our grid system, known as The Lattice Project, was the first to combine two models of grid computing - the service model, which mainly federates large institutional HPC resources, and the desktop model, which harnesses the power of PCs volunteered by the general public. Recently we have developed a "science portal" style web interface that makes it easier than ever for phylogenetic analyses to be completed using GARLI, a popular program that uses a maximum likelihood method to infer the evolutionary history of organisms on the basis of genetic sequence data. This paper describes our approach to scheduling thousands of GARLI jobs with diverse requirements to heterogeneous grid resources, which include volunteer computers running BOINC software. A key component of this system provides a priori GARLI runtime estimates using machine learning with random forests.
  • Keywords
    Internet; data handling; evolutionary computation; grid computing; information services; learning (artificial intelligence); maximum likelihood estimation; portals; trees (mathematics); user interfaces; BOINC software; GARLI jobs; HPC resource; Web interface; computational power; data sets; evolutionary history; evolutionary model; genetic sequence data; grid computing system; heterogeneous grid resource; lattice project; life tree computing; machine learning; maximum likelihood method; molecular evolutionary systematics; phylogenetic analysis; science portal; service grids; service model; substantial computational load; Computational modeling; Grid computing; Lattices; Phylogeny; Runtime; Software; Vegetation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.344
  • Filename
    6009062