DocumentCode
1996884
Title
Performance Implications from Sizing a VM on Multi-core Systems: A Data Analytic Application´s View
Author
Seung-Hwan Lim ; Horey, James ; Yanjun Yao ; Begoli, Edmon ; Qing Cao
Author_Institution
Oak Ridge Nat. Lab., Oak Ridge, TN, USA
fYear
2013
fDate
20-24 May 2013
Firstpage
1001
Lastpage
1008
Abstract
In this paper, we present a quantitative performance analysis of data analytics applications running on multi-core virtual machines. Such environments form the core of cloud computing. In addition, data analytics applications, such as Cassandra and Hadoop, are becoming increasingly popular on cloud computing platforms. This convergence necessitates a better understanding of the performance and cost implications of such hybrid systems. For example, the very first step in hosting applications in virtualized environments, requires the user to configure the number of virtual processors and the size of memory. To understand performance implications of this step, we benchmarked three Yahoo Cloud Serving Benchmark(YCSB) workloads in a virtualized multi-core environment. Our measurements indicate that the performance of Cassandra for YCSB workloads does not heavily depend on the processing capacity of a system, while the size of the data set is critical to performance relative to allocated memory. We also identified a strong relationship between the running time of workloads and various hardware events (last level cache loads, misses, and CPU migrations). From this analysis, we provide several suggestions to improve the performance of data analytics applications running on cloud computing environments.
Keywords
cloud computing; data analysis; multiprocessing systems; virtual machines; VM; YCSB workloads; Yahoo cloud serving benchmark; cloud computing environments; data analytic application; multicore virtual machines; virtualized multicore environment; Cloud computing; Hardware; Kernel; Program processors; Regression analysis; Virtual machine monitors; Virtual machining;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location
Cambridge, MA
Print_ISBN
978-0-7695-4979-8
Type
conf
DOI
10.1109/IPDPSW.2013.97
Filename
6650984
Link To Document