• DocumentCode
    379070
  • Title

    Scalable monitoring and configuration tools for grids and clusters

  • Author

    Augerat, Philippe ; Martin, Cyrill ; Stein, Benhur

  • Author_Institution
    Inst. Nat. Polytech. de Grenoble, France
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    147
  • Lastpage
    153
  • Abstract
    We present the Ka-admin project that addresses the problem of collecting, visualizing and feeding back any grid information, trace or snapshot, compliant to an XML-like model. Real use includes performance analysis of parallel applications and cluster administration. Ka-admin includes a generic "filter" module that processes monitored data independently of what they represent. Filters can remove, aggregate and transform data or pass them to external applications. The end user is responsible for activating the filters from within an interactive graphical interface. This allows him to focus on important information. We also present a "scatter/gather" module that allows efficient collection and distribution of data and commands in a large cluster. Early work on "MPI/threads" applications and system monitoring tools proved that the combination of both modules matches the objective of a scalable visualization of large data sets
  • Keywords
    application program interfaces; message passing; performance evaluation; system monitoring; workstation clusters; Ka-admin project; MPI/threads applications; XML-like model; cluster administration; clusters; filter module; grid information collection; grid information visualization; grids; interactive graphical interface; large data sets; parallel applications; performance analysis; scalable configuration tools; scalable monitoring tools; scalable visualization; scatter/gather module; snapshot information; trace information; Aggregates; Application software; Computer architecture; Computerized monitoring; Data visualization; Filters; High performance computing; Laboratories; Performance analysis; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
  • Conference_Location
    Canary Islands
  • Print_ISBN
    0-7695-1444-8
  • Type

    conf

  • DOI
    10.1109/EMPDP.2002.994255
  • Filename
    994255