DocumentCode :
683519
Title :
Bandwidth-aware data placement scheme for Hadoop
Author :
Shabeera, T.P. ; Madhu Kumar, S.D.
Author_Institution :
Nat. Inst. of Technol. Calicut, Calicut, India
fYear :
2013
fDate :
19-21 Dec. 2013
Firstpage :
64
Lastpage :
67
Abstract :
We are living in a data rich era. The size of the data is increasing exponentially. Social networking applications, Scientific experiments, etc. are the major contributors of Big Data. The data can be structured, semi-structured or unstructured. Big Data management solutions can be implemented in-house in the organization or it can be stored in cloud. Whether it is stored in-house or in cloud, the placement of data is very important. In general, users demand the availability of data whenever they request for it. There are many parameters that effect the data retrieval time in Hadoop. Among them, this paper pays attention to the available bandwidth. To minimize the data retrieval time, the data must be placed in a DataNode which has the maximum bandwidth. We have proposed a solution for bandwidth-aware data placement in Hadoop by periodically measuring the bandwidth between clients and DataNodes and placing the data blocks in DataNodes that have maximum end-to-end bandwidth.
Keywords :
Big Data; data structures; Big Data management; DataNode; Hadoop; bandwidth-aware data placement; data retrieval time; end-to-end bandwidth; social networking; Bandwidth; Conferences; Data handling; Data storage systems; Distributed databases; Information management; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computational Systems (RAICS), 2013 IEEE Recent Advances in
Conference_Location :
Trivandrum
Print_ISBN :
978-1-4799-2177-5
Type :
conf
DOI :
10.1109/RAICS.2013.6745448
Filename :
6745448
Link To Document :
بازگشت