• DocumentCode
    228782
  • Title

    FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management

  • Author

    Yanfei Guo ; Jia Rao ; Changjun Jiang ; Xiaobo Zhou

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Colorado, Colorado Springs, CO, USA
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    959
  • Lastpage
    969
  • Abstract
    Load imbalance is a major source of overhead in Hadoop where the uneven distribution of input data among tasks can significantly delays the job completion. Running Hadoop in a private cloud opens up opportunities for mitigating data skew with elastic resource allocation, where stragglers are expedited with more resources, yet introduces problems that often cancel out the performance gain: (1) performance interference from co running jobs may create new stragglers, (2) there exist a semantic gap between Hadoop task management and resource pool-based virtual cluster management preventing efficient resource usage. We present FlexSlot, a user-transparent task slot management scheme that automatically identifies map stragglers and resizes their slots accordingly to accelerate task execution. FlexSlot adaptively changes the number of slots on each virtual node to promote efficient usage of resource pool. Experimental results with representative benchmarks show that FlexSlot effectively reduces job completion time by 46% and achieves better resource utilization.
  • Keywords
    cloud computing; data handling; parallel processing; resource allocation; FlexSlot; Hadoop task management; data distribution; data skew; flexible slot management; job completion time; load imbalance; map stragglers; performance interference; private cloud; resource allocation; resource pool-based virtual cluster management; resource usage; resource utilization; semantic gap; task execution; user-transparent task slot management scheme; virtual node; Acceleration; Cloud computing; Dynamic scheduling; Measurement; Memory management; Resource management; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.83
  • Filename
    7013065