Title :
Selective commitment and selective margin: Techniques to minimize cost in an IaaS cloud
Author :
Hong, Yu-Ju ; Xue, Jiachen ; Thottethodi, Mithuna
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
Abstract :
Cloud computing holds the exciting potential of elastically scaling computation to match time-varying demand, thus eliminating the need to provision for peak demand. However, the uncertainty of variable loads necessitate the use of margins - servers that must be held active to absorb unpredictable potential load bursts - which can be a significant fraction of overall cost. Further, naively switching to an on-demand cloud model can actually degrade true costs (server costs that would be incurred even if margin costs disappeared) because of the fundamental economic rule wherein on-demand services/goods cost more compared to reserved services/goods where the user bears some commitment. On-demand customers pay a premium in exchange for not undertaking the fixed-cost risk that committed customers undertake. This paper addresses the twin challenges of minimizing margin costs and true costs in an Infrastructure-as-a-Service (IaaS) cloud. Our paper makes the following two contributions. First, rather than use a fixed margin, we observe that the margin may be selectively used depending on load levels. Based on the above observation, we develop ShrinkWrap-opt which is a dynamic programming algorithm that achieves optimal margin cost while satisfying the desired (statistical) response time guarantees. Second, we propose commitment straddling - the selective use of some reserved machines in conjunction with on-demand machines - to achieve optimal true-cost. Simulations with real Web server load traces using the Amazon EC2 cost model reveal that our techniques save between 13% and 29% (21% on average) in cost while satisfying response-time targets.
Keywords :
cloud computing; cost reduction; dynamic programming; Amazon EC2 cost model; IaaS cloud; ShrinkWrap; Web server load trace; cloud computing; cost minimization; dynamic programming algorithm; elastically scaling computation potential; fixed-cost risk; infrastructure-as-a-service; load burst; margin cost; on-demand cloud model; on-demand customer; peak demand provision; response-time target; selective commitment; selective margin; server cost; Dynamic programming; Equations; Load modeling; Nickel; Servers; Time factors; Vectors;
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2012 IEEE International Symposium on
Conference_Location :
New Brunswick, NJ
Print_ISBN :
978-1-4673-1143-4
Electronic_ISBN :
978-1-4673-1145-8
DOI :
10.1109/ISPASS.2012.6189210