DocumentCode
611083
Title
DUAL: Reliability-Aware Power Management in Data Centers
Author
Xin Xu ; Teramoto, Kenbu ; Morales, Aythami ; Huang, He Helen
Author_Institution
George Washington Univ., Washington, DC, USA
fYear
2013
fDate
13-16 May 2013
Firstpage
530
Lastpage
537
Abstract
A virtualized data center hosts users and applications within a large number of virtual machines (VM) to achieve easy provisioning and high utilization of physical resources. Energy efficiency and reliability are two primary concerns for operating a data center. Power saving techniques, such as dynamic voltage and frequency scaling (DVFS), are often employed to reduce the supply voltages of the CPUs in runtime when the computer system utilization is low. However, DVFS can potentially decrease the system reliability - the processors at low voltages are more likely to encounter soft errors that may result in VM or system crashes. In this work, we propose a data center management framework, DUAL, which consists of the new virtual machine power and reliability analysis tools. The framework is designed to balance the dual needs of a data center: reducing energy consumption and providing high reliability. The evaluations show that DUAL can help maintain the desired reliability and significantly reduce power consumption, which in turn will lower the overall operational cost of a data center.
Keywords
building management systems; computer centres; energy conservation; energy management systems; power aware computing; power consumption; reliability; resource allocation; system recovery; virtual machines; CPU; DVFS; VM crash; computer system utilization; data center management framework; dynamic voltage and frequency scaling; energy consumption; energy efficiency; physical resource provisioning; physical resource utilization; power consumption; power saving; reliability analysis tool; reliability-aware power management; soft error; supply voltage; system crash; system reliability; system runtime; virtual machine power; virtualized data center; Adaptation models; Computer crashes; Mathematical model; Power demand; Program processors; Reliability; Servers;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on
Conference_Location
Delft
Print_ISBN
978-1-4673-6465-2
Type
conf
DOI
10.1109/CCGrid.2013.82
Filename
6546135
Link To Document