مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimizing virtual machine consolidation performance on NUMA server architecture for cloud workloads

DocumentCode :

177336

Title :

Optimizing virtual machine consolidation performance on NUMA server architecture for cloud workloads

Author :

Ming Liu ; Tao Li

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Florida, Gainesville, FL, USA

fYear :

2014

fDate :

14-18 June 2014

Firstpage :

325

Lastpage :

336

Abstract :

Server virtualization and workload consolidation enable multiple workloads to share a single physical server, resulting in significant energy savings and utilization improvements. The shift of physical server architectures to NUMA and the increasing popularity of scale-out cloud applications undermine workload consolidation efficiency and result in overall system degradation. In this work, we characterize the consolidation of cloud workloads on NUMA virtualized systems, estimate four different sources of architecture overhead, and explore optimization opportunities beyond the default NUMA-aware hypervisor memory management. Motivated by the observed architectural impact on cloud workload consolidation performance, we propose three optimization techniques incorporating NUMA access overhead into the hypervisor´s virtual machine memory allocation and page fault handling routines. Among these, estimation of the memory zone access overhead serves as a foundation for the other two techniques: a NUMA overhead aware buddy allocator and a P2M swap FIFO. Cache hit rate, cycle loss due to cache miss, and IPC serve as indicators to estimate the access cost of each memory node. Our optimized buddy allocator dynamically selects low-overhead memory zones and “proportionally” distributes memory pages across target nodes. The P2M swap FIFO records recently unused <;PFN, MFN> lists for mapping exchanges to rebalance memory access pressure within one domain. Our real system based evaluations show a 41.1% performance improvement when consolidating 16-VMs on a 4-socket server (the proposed allocator contributes 22.8% of the performance gain and the P2M swap FIFO accounts for the rest). Furthermore, our techniques can cooperate well with other methods (i.e. vCPU migration) and scale well when varying VM memory size and the number of sockets in a physical host.

Keywords :

cache storage; cloud computing; memory architecture; optimisation; storage management; virtual machines; IPC; NUMA server architecture; NUMA-aware hypervisor memory management; P2M swap FIFO; buddy allocator; cache hit rate; cloud workload consolidation; cycle loss; energy savings; memory zone access overhead; optimization technique; page fault handling routines; scale-out cloud application; server virtualization; vCPU migration; virtual machine consolidation; virtual machine memory allocation; Memory management; Optimization; Radiation detectors; Servers; Sockets; Virtual machine monitors; Virtual machining;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Architecture (ISCA), 2014 ACM/IEEE 41st International Symposium on

Conference_Location :

Minneapolis, MN

Print_ISBN :

978-1-4799-4396-8

Type :

conf

DOI :

10.1109/ISCA.2014.6853224

Filename :

6853224

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=177336