• DocumentCode
    656238
  • Title

    Intralayer Communication for Tree-Based Overlay Networks

  • Author

    Hilbrich, Tobias ; Protze, Joachim ; de Supinski, Bronis R. ; Schulz, Markus ; Muller, Matthias S. ; Nagel, Wolfgang E.

  • Author_Institution
    Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    995
  • Lastpage
    1003
  • Abstract
    While various HPC tools use Tree-Based Overlay Networks (TBONs) to increase their scalability, some use cases do not map well to a tree-based hierarchy. We provide the concept of intralayer communication to improve this situation, where nodes in a specific hierarchy layer may exchange messages directly with each other. This concept targets data preprocessing that allows tool developers to avoid load imbalances in higher hierarchy levels. We implement intralayer communication within the Generic Tools Infrastructure (GTI) that provides TBON services, as well as a high-level abstraction to ease the creation of scalable runtime tools. An extension of GTI´s abstractions allows simple and efficient use of intralayer communication. We demonstrate this capability with a runtime message matching tool for MPI´s point-to-point communication, which we evaluate in an application study with up to 16,384 processes. Low overheads for two benchmark suites show the applicability of our approach, while a stress test demonstrates close to constant overheads across scales. The stress test measurements demonstrate that intralayer communication reduces application slowdown by two orders of magnitude at 2,048 processes, compared to a previous TBON-based implementation.
  • Keywords
    application program interfaces; message passing; parallel processing; trees (mathematics); GTI; HPC tools; MPI; TBONs; data preprocessing; generic tools infrastructure; hierarchy layer; intralayer communication; point-to-point communication; runtime message matching tool; stress test measurements; tree-based overlay networks; Computer crashes; Layout; Protocols; Receivers; Runtime; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2013 42nd International Conference on
  • Conference_Location
    Lyon
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2013.118
  • Filename
    6687443