• DocumentCode
    2991763
  • Title

    MTSD: A Task Scheduling Algorithm for MapReduce Base on Deadline Constraints

  • Author

    Tang, Zhuo ; Zhou, Junqing ; Li, Kenli ; Li, Ruixuan

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    2012
  • Lastpage
    2018
  • Abstract
    The previous works about MapReduce task scheduling with deadline constraints neither take the diffenences of Map and Reduce task, nor the cluster´s heterogeneity into account. This paper proposes an extensional MapReduce Task Scheduling algorithm for Deadline constraints in Hadoop platform: MTSD. It allows user specify a job´s deadline and tries to make the job be finished before the deadline. Through measuring the node´s computing capacity, a node classification algorithm is proposed in MTSD. This algorithm classifies the nodes into several levels in heterogeneous clusters. Under this algorithm, we firstly illuminate a novel data distribution model which distributes data according to the node´s capacity level respectively. The experiments show that the data locality is improved about 57%. Secondly, we calculate the task´s average completion time which is based on the node level. It improves the precision of task´s remaining time evaluation. Finally, MTSD provides a mechanism to decide which job´s task should be scheduled by calculating the Map and Reduce task slot requirements.
  • Keywords
    distributed programming; scheduling; Deadline constraints; Hadoop platform; MTSD; Map task; MapReduce base; data distribution model; data locality; deadline constraints; extensional MapReduce task scheduling algorithm; heterogeneous clusters; node classification algorithm; Classification algorithms; Clustering algorithms; Computational modeling; Data models; Data processing; Scheduling; Scheduling algorithms; Hadoop; MapReduce; data locality; deadline constraints; scheduling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.250
  • Filename
    6270409