DocumentCode
720551
Title
An Availability-on-Demand Mechanism for Datacenters
Author
Siqi Shen ; Iosup, Alexandru ; Israel, Assaf ; Cirne, Walfredo ; Raz, Danny ; Epema, Dick
Author_Institution
Delft Univ. of Technol., Delft, Netherlands
fYear
2015
fDate
4-7 May 2015
Firstpage
495
Lastpage
504
Abstract
Data enters are at the core of a wide variety of daily ICT utilities, ranging from scientific computing to online gaming. Due to the scale of today´s data enters, the failure of computing resources is a common occurrence that may disrupt the availability of ICT services, leading to revenue loss. Although many high availability (HA) techniques have been proposed to mask resource failures, datacenter users´ -- who rent datacenter resources and use them to provide ICT utilities to a global population´ -- still have limited management options for dynamically selecting and configuring HA techniques. In this work, we propose Availability-on-Demand (AoD), a mechanism consisting of an API that allows datacenter users to specify availability requirements which can dynamically change, and an availability-aware scheduler that dynamically manages computing resources based on user-specified requirements. The mechanism operates at the level of individual service instance, thus enabling fine-grained control of availability, for example during sudden requirement changes and periodic operations. Through realistic, trace-based simulations, we show that the AoD mechanism can achieve high availability with low cost. The AoD approach consumes about the same CPU hours but with higher availability than approaches which use HA techniques randomly. Moreover, comparing to an ideal approach which has perfect predictions about failures, it consumes 13% to 31% more CPU hours but achieves similar availability for critical parts of applications.
Keywords
application program interfaces; computer centres; reliability; resource allocation; API; AoD mechanism; HA techniques; availability-aware scheduler; availability-on-demand mechanism; computing resource management; datacenters; Computational modeling; Dynamic scheduling; Games; Heuristic algorithms; Processor scheduling; Resource management; Turning; Availability on demand; cloud computing; datacenter;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location
Shenzhen
Type
conf
DOI
10.1109/CCGrid.2015.58
Filename
7152515
Link To Document