• DocumentCode
    2806265
  • Title

    A Framework for Managing MapReduce Applications in Dynamic Distributed Environments

  • Author

    Marozzo, Fabrizio ; Talia, Domenico ; Trunfio, Paolo

  • Author_Institution
    DEIS, Univ. of Calabria, Rende, Italy
  • fYear
    2011
  • fDate
    9-11 Feb. 2011
  • Firstpage
    149
  • Lastpage
    158
  • Abstract
    MapReduce is a programming model widely used in data centers for processing large data sets in a highly parallel way. Current MapReduce systems are based on master-slave architectures that do not cope well with dynamic node participation, since they are mostly designed for conventional parallel computing platforms. On the contrary, in Internet-based computing environments, node churn and failures - including master failures - are likely to happen since nodes join and leave the network at an unpredictable rate. The goal of this work is enabling the use of MapReduce in dynamic distributed environments so as to combine the effectiveness of a well-established programming model with the scalability of a large-scale computing infrastructure. This paper presents an adaptive MapReduce framework, called P2P-MapReduce, which exploits a peer-to-peer model to manage intermittent node participation, master failures and job recovery in a decentralized but effective way, so as to provide a more robust MapReduce middleware that can be effectively exploited in Internet-scale dynamic distributed environments.
  • Keywords
    Internet; middleware; peer-to-peer computing; Internet-based computing environments; MapReduce application management framework; P2P-MapReduce middleware; data centers; dynamic distributed environments; job recovery; large-scale computing infrastructure; master failures; master-slave architectures; parallel computing platforms; peer-to-peer model; programming model; Computational modeling; Computer architecture; Data models; Indexes; Maintenance engineering; Peer to peer computing; Programming; Internet computing; MapReduce; Peer-to-peer computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
  • Conference_Location
    Ayia Napa
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4244-9682-2
  • Type

    conf

  • DOI
    10.1109/PDP.2011.41
  • Filename
    5738982