• DocumentCode
    602601
  • Title

    Optimizing Google´s warehouse scale computers: The NUMA experience

  • Author

    Lingjia Tang ; Mars, Jason ; Xiao Zhang ; Hagmann, R. ; Hundt, R. ; Tune, E.

  • Author_Institution
    Univ. of California, San Diego, La Jolla, CA, USA
  • fYear
    2013
  • fDate
    23-27 Feb. 2013
  • Firstpage
    188
  • Lastpage
    197
  • Abstract
    Due to the complexity and the massive scale of modern warehouse scale computers (WSCs), it is challenging to quantify the performance impact of individual microarchitectural properties and the potential optimization benefits in the production environment. As a result of these challenges, there is currently a lack of understanding of the microarchitecture-workload interaction, leaving potentially significant performance on the table. This paper argues for a two-phase performance analysis methodology for optimizing WSCs that combines both an in-production investigation and an experimental load-testing study. To demonstrate the effectiveness of this two-phase approach, and to illustrate the challenges, methodologies and opportunities in optimizing modern WSCs, this paper investigates the impact of non-uniform memory access (NUMA) for several Google´s key web-service workloads in large-scale production WSCs. Leveraging a newly-designed metric and continuous large-scale profiling in live datacenters, our production analysis demonstrates that NUMA has a significant impact (10-20%) on two important web-services: Gmail backend and web-search frontend. Our carefully designed load-test further reveals surprising tradeoffs between optimizing for NUMA performance and reducing cache contention.
  • Keywords
    Web services; Web sites; cache storage; computational complexity; computer centres; data warehouses; optimisation; software architecture; Gmail; Google key Web service workloads; Google warehouse scale computer optimization; NUMA performance; Web-search frontend; cache contention reduction; continuous large-scale profiling; experimental load-testing study; in-production investigation; individual microarchitectural properties; large-scale production WSC; live datacenters; microarchitecture-workload interaction; modern warehouse scale computers; nonuniform memory access; potential optimization benefits; production environment; two-phase performance analysis methodology; Correlation; Google; Measurement; Memory management; Microarchitecture; Production; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on
  • Conference_Location
    Shenzhen
  • ISSN
    1530-0897
  • Print_ISBN
    978-1-4673-5585-8
  • Type

    conf

  • DOI
    10.1109/HPCA.2013.6522318
  • Filename
    6522318