DocumentCode :
1791760
Title :
A contention aware hybrid evaluator for schedulers of big data applications in computer clusters
Author :
Bardhan, Shouvik ; Menasce, Daniel A.
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
11
Lastpage :
19
Abstract :
Large enterprises use clusters of computers to process Big Data workloads that are heterogeneous in terms of the type of jobs and the nature of their arrival processes. The scheduling of jobs from such workloads has a significant impact on their execution times. This paper presents a Trace Driven Analytic Model (TDAM) methodology to assess the impact of different scheduling schemes on job execution times. The analytic models used by this method consist of closed queuing network methods that estimate congestion at the various nodes of the cluster. The paper demonstrates the usefulness of this approach by showing how four different types of common schedulers affect the execution times of jobs derived from well-known benchmarks. This method is then implemented inside of a popular Hadoop job-trace simulator called Mumak, making Mumak contention-aware. The original Mumak tool completely ignores contention for processors and I/O at each node of the cluster. Our contentiion-aware Mumak predicts job completion times at a significantly higher level of accuracy.
Keywords :
Big Data; business data processing; digital simulation; distributed processing; network theory (graphs); queueing theory; scheduling; Hadoop job-trace simulator; TDAM; big data applications schedulers; big data workloads; closed queuing network methods; computer clusters; contentiion-aware Mumak; contention aware hybrid evaluator; enterprises; job completion times; job execution times; scheduling schemes; trace driven analytic model methodology; Analytical models; Big data; Mathematical model; Queueing analysis; Scheduling; Servers; Time factors; Hadoop; Mean Value Analysis; Mumak; analytic models; experimentation; queuing theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004439
Filename :
7004439
Link To Document :
بازگشت