DocumentCode :
3668385
Title :
HPCA: A node selection and scheduling method for Hadoop MapReduce
Author :
Archana G.K.;V.Deeban Chakravarthy
Author_Institution :
Department Of Computer Science and Engineering, SRM University, Chennai, India
fYear :
2015
Firstpage :
368
Lastpage :
372
Abstract :
Big data is the technology which is designed to handle both structured and unstructured data which has high intensity. Hadoop and MapReduce are two important aspects of big data. Task assignment in MapReduce is done through scheduling algorithms. Scheduling algorithms assign the tasks to a selected data node. Selection of a healthy and available data node to perform the Map and reduce is done based on the availability and the location of the data on which the processing should be done. Creating an algorithm for the node selection is essential to discipline and optimize and improve the performance of the MapReduce. The proposed Health, Priority, Capacity and Availability based Node selection algorithm [HPCA based Node Selection Algorithm] creates a queue of the nodes that are available for accepting the new tasks through scheduling algorithms. This algorithm optimizes the node selection task and provides better performance. It also introduces a failover mechanism to handle the tasks that fail during the execution.
Keywords :
"Scheduling algorithms","Big data","Clustering algorithms","Computer science","Computer architecture"
Publisher :
ieee
Conference_Titel :
Computing and Communications Technologies (ICCCT), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICCCT2.2015.7292777
Filename :
7292777
Link To Document :
بازگشت