• DocumentCode
    3074565
  • Title

    Evaluation and optimization of multicore performance bottlenecks in supercomputing applications

  • Author

    Diamond, Jeff ; Burtscher, Martin ; McCalpin, John D. ; Kim, Byoung-Do ; Keckler, Stephen W. ; Browne, James C.

  • Author_Institution
    Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2011
  • fDate
    10-12 April 2011
  • Firstpage
    32
  • Lastpage
    43
  • Abstract
    The computation nodes of modern supercomputers commonly consist of multiple multicore processors. To maximize the performance of such systems requires measurement, analysis, and optimization techniques that specifically target multicore environments. This paper first examines traditional unicore metrics and demonstrates how they can be misleading in a multicore system. Second, it examines and characterizes performance bottlenecks specific to multicore-based systems. Third, it describes performance measurement challenges that arise in multicore systems and outlines methods for extracting sound measurements that lead to performance optimization opportunities. The measurement and analysis process is based on a case study of the HOMME atmospheric modeling benchmark code from NCAR running on supercomputers built upon AMD Barcelona and Intel Nehalem quad-core processors. Applying the multicore bottleneck analysis to HOMME led to multicore aware source-code optimizations that increased performance by up to 35%. While the case studies were carried out on multichip nodes of supercomputers using an HPC application as the target for optimization, the pitfalls identified and the insights obtained should apply to any system that is composed of multicore processors.
  • Keywords
    mainframes; microprocessor chips; multiprocessing systems; performance evaluation; source coding; AMD Barcelona; HOMME atmospheric modeling benchmark code; HPC application; Intel Nehalem quadcore processor; NCAR; multichip nodes; multicore aware source code optimization; multicore performance bottlenecks; multicore processors; performance measurement; performance optimization; sound measurement; supercomputing; unicore metrics; Atmospheric modeling; Multicore processing; Optimization; Program processors; Scalability; Semiconductor device measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-61284-367-4
  • Electronic_ISBN
    978-1-61284-368-1
  • Type

    conf

  • DOI
    10.1109/ISPASS.2011.5762713
  • Filename
    5762713