• DocumentCode
    228654
  • Title

    Dissecting On-Node Memory Access Performance: A Semantic Approach

  • Author

    Gimenez, Alfredo ; Gamblin, Todd ; Rountree, Barry ; Bhatele, Abhinav ; Jusufi, Ilir ; Bremer, Peer-Timo ; Hamann, Bernd

  • Author_Institution
    Dept. of Comput. Sci., Univ. of California, Davis, Davis, CA, USA
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    166
  • Lastpage
    176
  • Abstract
    Optimizing memory access is critical for performance and power efficiency. CPU manufacturers have developed sampling-based performance measurement units (PMUs) that report precise costs of memory accesses at specific addresses. However, this data is too low-level to be meaningfully interpreted and contains an excessive amount of irrelevant or uninteresting information. We have developed a method to gather fine-grained memory access performance data for specific data objects and regions of code with low overhead and attribute semantic information to the sampled memory accesses. This information provides the context necessary to more effectively interpret the data. We have developed a tool that performs this sampling and attribution and used the tool to discover and diagnose performance problems in real-world applications. Our techniques provide useful insight into the memory behaviour of applications and allow programmers to understand the performance ramifications of key design decisions: domain decomposition, multi-threading, and data motion within distributed memory systems.
  • Keywords
    distributed memory systems; multi-threading; storage management; CPU manufacturers; PMU; attribute semantic information; code regions; data motion; data objects; design decisions; distributed memory systems; domain decomposition; fine-grained memory access performance data; memory access optimization; memory behaviour; multithreading; on-node memory access performance; performance ramifications; power efficiency; sampled memory accesses; sampling-based performance measurement units; semantic approach; Context; Hardware; Kernel; Libraries; Program processors; Semantics; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.19
  • Filename
    7013001