DocumentCode
2865782
Title
An Evaluation of Power-Proportional Data Placement for Hadoop Distributed File Systems
Author
Le, Hieu Hanh ; Hikida, Satoshi ; Yokota, Haruo
Author_Institution
Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan
fYear
2011
fDate
12-14 Dec. 2011
Firstpage
752
Lastpage
759
Abstract
Power-saving storage in data centers is now gaining much interest because of the increase in power requirements. In particular, with the expansion of services requiring distributed-processing frameworks, power proportionality in power-aware file systems is attracting great attention from academia and industry. The concept of power proportionality is that a system should perform work in proportion to the energy it consumes. To provide this important characteristic, data placement methods that enable a system to operate in multiple gears, each containing a different number of active nodes, have been proposed. Of these, RABBIT, with its power proportional data placement, is a novel method that is implemented over a Hadoop Distributed File System (HDFS) and has been shown to be successful in guaranteeing power proportionality for read only tasks. However, the data layout used in RABBIT does not consider write access occurring when the system operates in low gear and there are inactive nodes. In this paper, to discover an appropriate approach for power-efficient distributed file systems, we evaluate the performance of the RABBIT and PARAID methods on read-only tasks and the cost to system performance of write access in low gear. The skewed data placement in PARAID, which was one of the first methods to suggest power proportionality in disk-based storage systems, is supposed to offer good performance when dealing with a frequently updated dataset.
Keywords
computer centres; disc storage; distributed databases; energy conservation; Hadoop distributed file systems; PARAID; RABBIT; data centers; disk based storage systems; distributed processing frameworks; power efficient distributed file systems; power proportional data placement; power requirements; power saving storage; Distributed databases; Gears; Layout; Power demand; Rabbits; Servers; Writing; HDFS; PARAID; RABBIT; cloud computing; distributed file system; energy aware; power consumption aware; power proportionality;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-1-4673-0006-3
Type
conf
DOI
10.1109/DASC.2011.129
Filename
6118907
Link To Document