DocumentCode
2195299
Title
Optimizing MapReduce scheduling using datanode load prediction
Author
Patel, Dharmesh ; Hasan, Mosin ; Sharma, Kirti
Author_Institution
Department of Computer Engineering, Birla Vishwakarma Mahavidyalaya, Vallabh Vidyanagar, India
fYear
2015
fDate
24-25 Jan. 2015
Firstpage
1
Lastpage
4
Abstract
MapReduce [3] is a leading distributed programming model for data-intensive computing. The strength of the MapReduce program depends on splitting the large datasets into the small input blocks and processes them on the cluster of machines. The Optimization of the data-intensive computing model is mainly relying on the Job scheduler [5] component of Hadoop MapReduce framework. The JobTracker assigns the job to the TaskTrackers with the help of job scheduler [14]. The JobTracker does not consider the CPU intensive task running on the TaskTrackers while assigning new task to the TaskTrackers which leads to the node crash or failure. In our proposed Research method, job Tracker schedules the job by considering the load statistics of the TaskTracker into the account. TaskTracker will send current load statistics by modifying heartbeat message. The load statistic information includes the CPU information, physical memory, swap memory, disk IO etc.
Keywords
Computational modeling; Computers; File systems; Heart beat; Processor scheduling; Programming; Tutorials; HDFS; Hadoop; Heartbeat; JobTracker; MapReduce; Task Scheduler;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical, Electronics, Signals, Communication and Optimization (EESCO), 2015 International Conference on
Conference_Location
Visakhapatnam, India
Print_ISBN
978-1-4799-7676-8
Type
conf
DOI
10.1109/EESCO.2015.7253840
Filename
7253840
Link To Document