DocumentCode
683519
Title
Bandwidth-aware data placement scheme for Hadoop
Author
Shabeera, T.P. ; Madhu Kumar, S.D.
Author_Institution
Nat. Inst. of Technol. Calicut, Calicut, India
fYear
2013
fDate
19-21 Dec. 2013
Firstpage
64
Lastpage
67
Abstract
We are living in a data rich era. The size of the data is increasing exponentially. Social networking applications, Scientific experiments, etc. are the major contributors of Big Data. The data can be structured, semi-structured or unstructured. Big Data management solutions can be implemented in-house in the organization or it can be stored in cloud. Whether it is stored in-house or in cloud, the placement of data is very important. In general, users demand the availability of data whenever they request for it. There are many parameters that effect the data retrieval time in Hadoop. Among them, this paper pays attention to the available bandwidth. To minimize the data retrieval time, the data must be placed in a DataNode which has the maximum bandwidth. We have proposed a solution for bandwidth-aware data placement in Hadoop by periodically measuring the bandwidth between clients and DataNodes and placing the data blocks in DataNodes that have maximum end-to-end bandwidth.
Keywords
Big Data; data structures; Big Data management; DataNode; Hadoop; bandwidth-aware data placement; data retrieval time; end-to-end bandwidth; social networking; Bandwidth; Conferences; Data handling; Data storage systems; Distributed databases; Information management; Time measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computational Systems (RAICS), 2013 IEEE Recent Advances in
Conference_Location
Trivandrum
Print_ISBN
978-1-4799-2177-5
Type
conf
DOI
10.1109/RAICS.2013.6745448
Filename
6745448
Link To Document