• DocumentCode
    167617
  • Title

    SkewControl: Gini Out of the Bottle

  • Author

    Si Zheng ; Yunhuai Liu ; Tian He ; Li Shanshan ; Xiangke Liao

  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    1572
  • Lastpage
    1580
  • Abstract
    In the age of big data, MapReduce plays an important role in the extreme-scale data processing system. Among all the hot issues, the data skew weights heavily for the MapReduce system performance. In traditional approaches, researchers attempt to leave the users to address the issue which requires the user to possess the application-dependent domain knowledge. Other approaches address the issue automatically but in an open-loop manner which lacks of sufficient adaptivity for different applications. To well address these issues, we conduct trace-driven empirical studies and show that the skew has strong stable and predictable characteristics, which allows us to design a closed-loop automatic mechanism for task partitioning and scheduling, called SkewControl. We implement SkewControl on top of a Hadoop 1.0.4 production system. The experimental results show that compared with the state-of-art LATE and SkewTune systems, SkewControl can consistently improve the system response time by 23.8% and 17% respectively.
  • Keywords
    Big Data; scheduling; Big Data; Hadoop 1.0.4 production system; LATE systems; MapReduce system performance; SkewControl; SkewTune systems; application-dependent domain knowledge; closed-loop automatic mechanism; data skew; extreme-scale data processing system; scheduling; system response time; task partitioning; Cellular phones; Data processing; Distributed databases; Educational institutions; History; Manuals; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    978-1-4799-4117-9
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2014.176
  • Filename
    6969563