Title :
Optimizing Google´s warehouse scale computers: The NUMA experience
Author :
Lingjia Tang ; Mars, Jason ; Xiao Zhang ; Hagmann, R. ; Hundt, R. ; Tune, E.
Author_Institution :
Univ. of California, San Diego, La Jolla, CA, USA
Abstract :
Due to the complexity and the massive scale of modern warehouse scale computers (WSCs), it is challenging to quantify the performance impact of individual microarchitectural properties and the potential optimization benefits in the production environment. As a result of these challenges, there is currently a lack of understanding of the microarchitecture-workload interaction, leaving potentially significant performance on the table. This paper argues for a two-phase performance analysis methodology for optimizing WSCs that combines both an in-production investigation and an experimental load-testing study. To demonstrate the effectiveness of this two-phase approach, and to illustrate the challenges, methodologies and opportunities in optimizing modern WSCs, this paper investigates the impact of non-uniform memory access (NUMA) for several Google´s key web-service workloads in large-scale production WSCs. Leveraging a newly-designed metric and continuous large-scale profiling in live datacenters, our production analysis demonstrates that NUMA has a significant impact (10-20%) on two important web-services: Gmail backend and web-search frontend. Our carefully designed load-test further reveals surprising tradeoffs between optimizing for NUMA performance and reducing cache contention.
Keywords :
Web services; Web sites; cache storage; computational complexity; computer centres; data warehouses; optimisation; software architecture; Gmail; Google key Web service workloads; Google warehouse scale computer optimization; NUMA performance; Web-search frontend; cache contention reduction; continuous large-scale profiling; experimental load-testing study; in-production investigation; individual microarchitectural properties; large-scale production WSC; live datacenters; microarchitecture-workload interaction; modern warehouse scale computers; nonuniform memory access; potential optimization benefits; production environment; two-phase performance analysis methodology; Correlation; Google; Measurement; Memory management; Microarchitecture; Production; Servers;
Conference_Titel :
High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4673-5585-8
DOI :
10.1109/HPCA.2013.6522318