• DocumentCode
    243732
  • Title

    Exploiting Hadoop Topology in Virtualized Environments

  • Author

    de Fatima Pereira, Rosangela ; Akio Goya, Walter ; Langona, Karen ; Mimura Gonzalez, Nelson ; Melo de Brito Carvalho, Tereza Cristina ; Mangs, Jan-Erik ; Sefidcon, Azimeh

  • Author_Institution
    Lab. of Comput. Networks & Archit., Univ. of Sao Paulo, Sao Paulo, Brazil
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    301
  • Lastpage
    308
  • Abstract
    Virtualization is a key technique to make an environment easier to manage in terms of resource allocation. MapReduce is a programming model that provides an abstraction to perform distributed computation for large datasets. Hadoop is a well-known framework that offers an open source implementation for this model. Combining Hadoop and virtualization techniques in cloud-computing environments can unveil great potential, especially for big data context. However, running MapReduce jobs on virtual machines has indicated performance issues not solved yet. In this paper we present and discuss three scenarios regarding Hadoop topology in a cloud infrastructure. The first scenario proposes to allocate Hadoop daemons in a fully virtualized environment, the second scenario presents a hybrid environment, and the third scenario suggests to virtualize only MapReduce daemons.We also report results from a series of tests allocating Hadoop daemons in a fully virtualized environment. Results show that adding virtual machines to the cluster causes an overhead, decreases the efficiency of CPU utilization, and shortens the time slots for the MapReduce jobs.
  • Keywords
    cloud computing; data analysis; virtual machines; virtualisation; Hadoop daemons; Hadoop topology; MapReduce daemons; cloud-computing; virtual machines; virtualization; virtualized environments; Cloud computing; Hardware; Resource management; Virtual machine monitors; Virtual machining; Virtualization; MapReduce; cloud computing; virtualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Services (SERVICES), 2014 IEEE World Congress on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5068-3
  • Type

    conf

  • DOI
    10.1109/SERVICES.2014.60
  • Filename
    6903282