• DocumentCode
    1915908
  • Title

    Supporting Bulk Synchronous Parallelism in Map-Reduce Queries

  • Author

    Fegaras, Leonidas

  • Author_Institution
    CSE, Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1068
  • Lastpage
    1077
  • Abstract
    One of the major drawbacks of the Map-Reduce (MR) model is that, to simplify reliability and fault tolerance, it does not preserve data in memory across consecutive MR jobs: a MR job must dump its data to the distributed file system before they can be read by the next MR job. This restriction imposes a high overhead to complex MR workflows and graph algorithms, such as PageRank, which require repetitive MR jobs. The Bulk Synchronous Parallelism (BSP) programming model, on the other hand, has been recently advocated as an alternative to the MR model that does not suffer from this restriction, and, under certain circumstances, allows complex repetitive algorithms to run entirely in the collective memory of a cluster. We present a framework for translating complex declarative queries for scientific and graph data analysis applications to both MR and BSP evaluation plans, leaving the choice to be made at run-time based on the available resources. If the resources are sufficient, the query will be evaluated entirely in memory based on the BSP model, otherwise, the same query will be evaluated based on the MR model.
  • Keywords
    parallel programming; query processing; BSP programming model; MR job; MR workflow; Map-Reduce query; PageRank graph algorithms; bulk synchronous parallelism support; declarative query translation; distributed file system; repetitive algorithm; bulk synchronous parallelism; cloud computing; map-reduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.129
  • Filename
    6495911