• DocumentCode
    105326
  • Title

    NO2: Speeding up Parallel Processing of Massive Compute-Intensive Tasks

  • Author

    Yongwei Wu ; Weichao Guo ; Jinglei Ren ; Xun Zhao ; Weimin Zheng

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • Volume
    63
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    2487
  • Lastpage
    2499
  • Abstract
    Large-scale computing frameworks, either tenanted on the cloud or deployed in the high-end local cluster, have become an indispensable software infrastructure to support numerous enterprise and scientific applications. Tasks executed on these frameworks are generally classified into data-intensive and compute-intensive ones. However, most existing frameworks, led by MapReduce, are mainly suitable for data-intensive tasks. Their task schedulers assume that the proportion of data I/O reflects the task progress and state. Unfortunately, this assumption does not apply to most compute-intensive tasks. Due to biased estimation of task progress, traditional frameworks cannot timely cut off outliers and therefore largely prolong execution time when performing compute-intensive tasks. We propose a new framework designed for compute-intensive tasks. By using instrumentation and automatic instrument point selector, our framework estimates the compute-intensive task progress without resorting to data I/O. We employ a clustering method to identify outliers at runtime and perform speculative execution/aborting, speeding up task execution by up to 25%. Moreover, our improvement to bare instrumentation limits overhead within 0.1%, and the aborting-based execution only introduces 10% more average CPU usage. Low overhead and resource consumption make our framework practically usable in the production environment.
  • Keywords
    parallel processing; scheduling; CPU usage; MapReduce; aborting-based execution; automatic instrument point selector; biased estimation; compute-intensive task progress; data-intensive tasks; high-end local cluster; indispensable software infrastructure; large-scale computing frameworks; massive compute-intensive tasks; parallel processing; production environment; task execution; task schedulers; Binary codes; Instruments; Java; Processor scheduling; Runtime; Servers; Distributed system; compute-intensive; parallel processing; resource management;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.132
  • Filename
    6532279