• DocumentCode
    1153184
  • Title

    Feedback-based synchronization in system area networks for cluster computing

  • Author

    Song, Hyo Jung ; Chien, Andrew A.

  • Author_Institution
    Samsung Electron., Seoul, South Korea
  • Volume
    16
  • Issue
    10
  • fYear
    2005
  • Firstpage
    908
  • Lastpage
    920
  • Abstract
    Many applications in cluster computing require QoS (quality of service) services. Since performance predictability is essential to provide QoS service, underlying systems must provide predictable performance guarantees. One way to ensure such guarantees from network subsystems is to generate global schedules from applications´ network requests and to execute the local portion of the schedules at each network interface. To ensure accurate execution of the schedules, it is essential that a global time base must be maintained by local clocks at each network interface. The task of providing a single time base is called a synchronization problem and this paper addresses the problem for system area networks. To solve the synchronization problem, FM-QoS [K. Connelly (1999)] proposed a simple synchronization mechanism called FBS (feedback-based synchronization) which uses built-in flow control signals. This paper extends the basic notion of FM-QoS to a theoretical framework and generalizes it: 1) to identify a set of built-in network flow control signals for synchrony and to formalize it as a synchronizing schedule and 2) to analyze the synchronization precision of FBS in terms of flow control parameters. Based on generalization, two application classes are studied for a single switch network and a multiple switch network. For each class, a synchronizing schedule is proposed and its bounded skew is analyzed. Unlike FM-QoS, the synchronizing schedule is proven to minimize the bounded skew value for a single switch network. To understand the analysis results in practical networks, skew values are obtained with flow control parameters of Myrinet-2000. We observed that the maximum bounded skew of FBS is 5.79μsec or less over all our experiments. Based on this result, we came to a conclusion that FBS was a feasible synchronization mechanism in system area networks.
  • Keywords
    network interfaces; quality of service; switching networks; synchronisation; telecommunication congestion control; workstation clusters; QoS; bounded skew; cluster computing; feedback-based synchronization; link level flow control; network flow control signal; network interface; quality of service; switching networks; system area network; Clocks; Computer applications; Computer networks; Network interfaces; Processor scheduling; Quality of service; Signal analysis; Signal processing; Switches; Synchronization; Synchronization; cluster computing.; link level flow control; system area networks;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2005.122
  • Filename
    1501803