• DocumentCode
    413107
  • Title

    Identifying performance bottlenecks on modern microarchitectures using an adaptable probe

  • Author

    Griem, Gorden ; Oliker, Leonid ; Shalf, John ; Yelick, Katherine

  • Author_Institution
    Lawrence Berkeley Nat. Lab., CA, USA
  • fYear
    2004
  • fDate
    26-30 April 2004
  • Firstpage
    255
  • Abstract
    Summary form only given. The gap between peak and delivered performance for scientific applications running on microprocessor-based systems has grown considerably in recent years. The inability to achieve the desired performance even on a single processor is often attributed to an inadequate memory system, but without identification or quantification of a specific bottleneck. In this work, we use an adaptable synthetic benchmark to isolate application characteristics that cause a significant drop in performance, giving application programmers and architects information about possible optimizations. Our adaptable probe, called sqmat, uses only four parameters to capture key characteristics of scientific workloads: working-set size, computational intensity, indirection, and irregularity. This paper describes the implementation of sqmat and uses its tunable parameters to evaluate four leading 64-bit microprocessors that are popular building blocks for current high performance systems: Intel Itanium2, AMD Opteron, IBM Power3, and IBM Power4.
  • Keywords
    computer architecture; natural sciences computing; performance evaluation; AMD Opteron; IBM Power3; IBM Power4; Intel Itanium2; adaptable probe; adaptable synthetic benchmark; high performance systems; microarchitectures; microprocessor-based systems; performance bottlenecks; scientific applications; sqmat; Cyclotrons; Distributed processing; Equations; Laboratories; Linear algebra; Microarchitecture; Microprocessors; Probes; Registers; Supercomputers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
  • Print_ISBN
    0-7695-2132-0
  • Type

    conf

  • DOI
    10.1109/IPDPS.2004.1303320
  • Filename
    1303320