Title :
Job Aware Scheduling Algorithm for MapReduce Framework
Author :
Nanduri, Radheshyam ; Maheshwari, Nitesh ; Reddyraja, A. ; Varma, Vasudeva
Author_Institution :
Search & Inf. Extraction Lab. (SIEL), HIT, Hyderabad, India
fDate :
Nov. 29 2011-Dec. 1 2011
Abstract :
MapReduce framework has received a wide acclaim over the past few years for large scale computing. It has become a standard paradigm for batch oriented workloads. As the adoption of this paradigm has increased rapidly, scheduling of these MapReduce jobs has become a problem of great interest in research community. We propose an approach which tries to maintain harmony among the jobs running on the cluster, and in turn decrease their runtime. In our model, the scheduler is made aware of different types of jobs running on the cluster. The scheduler tries to allocate a task on a node if the incoming task does not affect the tasks already running on that node. From the list of available pending tasks, our algorithm selects the one that is most compatible with the tasks already running on that node. We bring up heuristic and machine learning based solutions to our approach and try to maintain a resource balance on the cluster by not overloading any of the nodes, thereby reducing the overall runtime of the jobs. The results show a saving of runtime of around 21% in the case of heuristic based approach and around 27% in the case of machine learning based approach when compared to Yahoo´s Capacity scheduler.
Keywords :
learning (artificial intelligence); scheduling; MapReduce framework; Yahoo capacity scheduler; batch oriented workloads; job aware scheduling algorithm; large scale computing; machine learning based solutions; Availability; Clustering algorithms; Heuristic algorithms; Machine learning; Machine learning algorithms; Runtime; Vectors; Cloud Computing; Job Scheduling; MachineLearning; MapReduce;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4673-0090-2
DOI :
10.1109/CloudCom.2011.112