• DocumentCode
    3065458
  • Title

    Job Aware Scheduling Algorithm for MapReduce Framework

  • Author

    Nanduri, Radheshyam ; Maheshwari, Nitesh ; Reddyraja, A. ; Varma, Vasudeva

  • Author_Institution
    Search & Inf. Extraction Lab. (SIEL), HIT, Hyderabad, India
  • fYear
    2011
  • fDate
    Nov. 29 2011-Dec. 1 2011
  • Firstpage
    724
  • Lastpage
    729
  • Abstract
    MapReduce framework has received a wide acclaim over the past few years for large scale computing. It has become a standard paradigm for batch oriented workloads. As the adoption of this paradigm has increased rapidly, scheduling of these MapReduce jobs has become a problem of great interest in research community. We propose an approach which tries to maintain harmony among the jobs running on the cluster, and in turn decrease their runtime. In our model, the scheduler is made aware of different types of jobs running on the cluster. The scheduler tries to allocate a task on a node if the incoming task does not affect the tasks already running on that node. From the list of available pending tasks, our algorithm selects the one that is most compatible with the tasks already running on that node. We bring up heuristic and machine learning based solutions to our approach and try to maintain a resource balance on the cluster by not overloading any of the nodes, thereby reducing the overall runtime of the jobs. The results show a saving of runtime of around 21% in the case of heuristic based approach and around 27% in the case of machine learning based approach when compared to Yahoo´s Capacity scheduler.
  • Keywords
    learning (artificial intelligence); scheduling; MapReduce framework; Yahoo capacity scheduler; batch oriented workloads; job aware scheduling algorithm; large scale computing; machine learning based solutions; Availability; Clustering algorithms; Heuristic algorithms; Machine learning; Machine learning algorithms; Runtime; Vectors; Cloud Computing; Job Scheduling; MachineLearning; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4673-0090-2
  • Type

    conf

  • DOI
    10.1109/CloudCom.2011.112
  • Filename
    6133221