• DocumentCode
    169846
  • Title

    BIGhybrid -- A Toolkit for Simulating MapReduce in Hybrid Infrastructures

  • Author

    Dos Anjos, Julio C. S. ; Fedak, Gilles ; Geyer, Claudio F. R.

  • Author_Institution
    Fed. Univ. of Rio Grande do Sul, Porto Alegre, Brazil
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    132
  • Lastpage
    137
  • Abstract
    Cloud computing has increasingly been used as a platform for running large business and data processing applications. Although clouds have become extremely popular, when it comes to data processing, their use incurs high costs. Conversely, Desktop Grids, have been used in a wide range of projects, and are able to take advantage of the large number of resources provided by volunteers, free of charge. Merging cloud computing and desktop grids into a hybrid infrastructure can provide a feasible low-cost solution for big data analysis. Although frameworks like MapReduce have been devised to exploit commodity hardware, their use in a hybrid infrastructure raise some challenges due to their large resource heterogeneity and high churn rate. This study introduces BIG hybrid, a toolkit that is used to simulate MapReduce in hybrid environments. Its main goal is to provide a framework for developers and system designers that can enable them to address the issues of Hybrid MapReduce. In this paper, we describe the framework which simulates the assembly of two existing middleware: Bit Dew-MapReduce for Desktop Grids and Hadoop-Blob Seer for Cloud Computing. The experimental results that are included in this work demonstrate the feasibility of our approach.
  • Keywords
    Big Data; cloud computing; middleware; BIGhybrid toolkit; Hadoop-Blob Seer; big data analysis; cloud computing; commodity hardware; data processing applications; desktop grids; high churn rate; hybrid MapReduce; hybrid environments; hybrid infrastructures; middleware; resource heterogeneity; Cloud computing; Computational modeling; Computer architecture; Data models; Data processing; Distributed databases; Data-Intensive Computing; Distributed Systems; Hybrid Infrastructures; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing Workshop (SBAC-PADW), 2014 International Symposium on
  • Conference_Location
    Paris
  • Type

    conf

  • DOI
    10.1109/SBAC-PADW.2014.8
  • Filename
    6972028