• DocumentCode
    1996884
  • Title

    Performance Implications from Sizing a VM on Multi-core Systems: A Data Analytic Application´s View

  • Author

    Seung-Hwan Lim ; Horey, James ; Yanjun Yao ; Begoli, Edmon ; Qing Cao

  • Author_Institution
    Oak Ridge Nat. Lab., Oak Ridge, TN, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1001
  • Lastpage
    1008
  • Abstract
    In this paper, we present a quantitative performance analysis of data analytics applications running on multi-core virtual machines. Such environments form the core of cloud computing. In addition, data analytics applications, such as Cassandra and Hadoop, are becoming increasingly popular on cloud computing platforms. This convergence necessitates a better understanding of the performance and cost implications of such hybrid systems. For example, the very first step in hosting applications in virtualized environments, requires the user to configure the number of virtual processors and the size of memory. To understand performance implications of this step, we benchmarked three Yahoo Cloud Serving Benchmark(YCSB) workloads in a virtualized multi-core environment. Our measurements indicate that the performance of Cassandra for YCSB workloads does not heavily depend on the processing capacity of a system, while the size of the data set is critical to performance relative to allocated memory. We also identified a strong relationship between the running time of workloads and various hardware events (last level cache loads, misses, and CPU migrations). From this analysis, we provide several suggestions to improve the performance of data analytics applications running on cloud computing environments.
  • Keywords
    cloud computing; data analysis; multiprocessing systems; virtual machines; VM; YCSB workloads; Yahoo cloud serving benchmark; cloud computing environments; data analytic application; multicore virtual machines; virtualized multicore environment; Cloud computing; Hardware; Kernel; Program processors; Regression analysis; Virtual machine monitors; Virtual machining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.97
  • Filename
    6650984