Title :
Real-Time Scheduling in MapReduce Clusters
Author :
Chen He ; Ying Lu ; Swanson, David
Author_Institution :
Comput. Sci. & Eng. Dept., Univ. of Nebraska-Lincoln, Lincoln, NE, USA
Abstract :
MapReduce has been widely used as a Big Data processing platform. As it gets popular, its scheduling becomes increasingly important. In particular, since many MapReduce applications require real-time data processing, scheduling real time applications in MapReduce environments has become a significant problem. In this paper, we create a novel real-time scheduler for MapReduce, which overcomes the deficiencies of an existing scheduler. It avoids accepting jobs that will lead to deadline misses and improves the cluster utilization. We implement our scheduler in Hadoop system and experimental results show that our scheduler provides deadline guarantees for accepted jobs and achieves good cluster utilization.
Keywords :
Big Data; distributed programming; pattern clustering; scheduling; Big Data processing platform; Hadoop system; MapReduce clusters; cluster utilization; real-time data processing; real-time scheduling; Adaptive control; Clustering algorithms; Estimation; Heart beat; Processor scheduling; Real-time systems; Vectors; MapReduce; cluster utilization; real-time scheduling;
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
DOI :
10.1109/HPCC.and.EUC.2013.216