• DocumentCode
    2906297
  • Title

    Scheduling Mixed Real-Time and Non-real-Time Applications in MapReduce Environment

  • Author

    Dong, Xicheng ; Wang, Ying ; Liao, Huaming

  • Author_Institution
    Inst. Of Comput. Technol., Beijing, China
  • fYear
    2011
  • fDate
    7-9 Dec. 2011
  • Firstpage
    9
  • Lastpage
    16
  • Abstract
    MapReduce scheduling is becoming a hot topic as MapReduce attracts more and more attention from both industry and academia. In this paper, we focus on the scheduling of mixed real-time and non-real-time applications in MapReduce environment, which is a challenging problem but receives only limited attention. To solve this problem, we present a two-level MapReduce scheduler built on previous techniques and make two key contributions. First, to meet the performance goal of real-time applications, we propose a deadline scheduler which adopts (1) a sampling based approach-Tasks Forward Scheduling (TFS) to predict map/reduce task execution time(unlike prior work that requires users to input an estimated value). (2) a resource allocation model-Approximately Uniform Minimum Degree of parallelism (AUMD) to dynamically control each realtime job to execute with minimum tasks assignment in any time so as to maximize the number of concurrent real-time jobs. Second, through integrating this deadline scheduler into existing MapReduce scheduler, we develop a two-level scheduler with resource preemption supported, and it could schedule mixed real-time and non-real-time jobs according to their respective performance demands. We implement our scheduler in Hadoop system and experiments running on a real, small-scale cluster demonstrate that it could schedule mixed real-time and nonreal-time jobs to meet their different quality-of-service (QoS) demands.
  • Keywords
    distributed processing; quality of service; resource allocation; sampling methods; scheduling; Hadoop system; MapReduce scheduling; MapReduce task execution time; application scheduling; approximately uniform minimum degree of parallelism model; deadline scheduler; mixed realtime application; nonrealtime application; quality-of-service demand; resource allocation model; resource preemption support; sampling based approach; tasks forward scheduling; Manganese; Processor scheduling; Quality of service; Real time systems; Resource management; Schedules; Deadline Scheduler; Hadoop; MapReduce; Non-real-time; Real-time; Two-level Scheduler;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on
  • Conference_Location
    Tainan
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4577-1875-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2011.115
  • Filename
    6121254