DocumentCode :
3681234
Title :
HDFS Heterogeneous Storage Resource Management Based on Data Temperature
Author :
Rohith Subramanyam
Author_Institution :
Dept. of Comput. Sci., Univ. of Wisconsin-Madison Madison, Madison, WI, USA
fYear :
2015
Firstpage :
232
Lastpage :
235
Abstract :
Hadoop has traditionally been used as a large-scale batch processing system. However, interactive applications such as Facebook Messenger are becoming increasingly prominent in the Hadoop world. A key bottleneck in adapting Hadoop to real-time processing is disk data transfer rate. The advent of Solid State Drives (SSDs) holds great promise in this regard as they provide bandwidth on the orders of magnitude better than that of rotating disks. But due to their higher cost per gigabyte, a common approach is to have heterogeneous storage types. This paper presents a Storage Resource Management technique that automatically and dynamically moves data across this tiered storage based on Data Temperature, migrating "hot" data towards faster storage and "cold" data towards inexpensive archival storage. Thus, the cluster adapts based on the characteristics of the workloads over time to make effective use of the scarce expensive storage. Finally, I evaluate my modified version of the Hadoop Distributed File System (HDFS) against the vanilla version to compare their performances. The results are promising and show an improvement in both read and write performance with a significant improvement in read performance.
Keywords :
"Temperature distribution","Throughput","Media","Random access memory","Generators","Resource management"
Publisher :
ieee
Conference_Titel :
Cloud and Autonomic Computing (ICCAC), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICCAC.2015.33
Filename :
7312163
Link To Document :
بازگشت