• DocumentCode
    2395503
  • Title

    Performance analysis for teraflop computers: a distributed automatic approach

  • Author

    Gerndt, Michael ; Schmidt, Andreas ; Schulz, Martin ; Wismüller, Roland

  • Author_Institution
    Inst. fur Inf., Technische Univ. Munchen, Germany
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    23
  • Lastpage
    30
  • Abstract
    Performance analysis for applications on teraflop computers requires a new combination of concepts: online processing, automation, and distribution. The article presents the design of a new analysis system that performs an automatic search for performance problems. This search is guided by a specification of performance properties based on the APART Specification Language. The system is being implemented as a network of analysis agents that are arranged in a hierarchy. Higher level agents search for global performance problems while lower level agents search local performance problems. Leaf agents request and receive performance data from the monitoring library linked to the application. Our online analysis also takes into account design patterns for parallel applications. These patterns make the analysis more effective and the output more application-related. The analysis is currently being implemented for the Hitachi SR8000 teraflop computer at the Leibniz-Rechenzentrum in Munich within the Peridot project
  • Keywords
    multiprocessing systems; performance evaluation; search problems; software agents; specification languages; workstation clusters; 1.3 TFLOPS; APART Specification Language; Hitachi SR8000 teraflop computer; Peridot project; analysis agents; automatic search; design patterns; distributed automatic approach; global performance problems; higher level agents; leaf agents; local performance problems; monitoring library; online analysis; online processing; parallel applications; parallel/distributed systems; performance analysis; performance data; performance modeling; performance problems; performance properties; teraflop computer performance; Application software; Automation; Computer networks; Computerized monitoring; Concurrent computing; Distributed computing; High performance computing; Pattern analysis; Performance analysis; Specification languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
  • Conference_Location
    Canary Islands
  • Print_ISBN
    0-7695-1444-8
  • Type

    conf

  • DOI
    10.1109/EMPDP.2002.994208
  • Filename
    994208