• DocumentCode
    2666973
  • Title

    Dynamic and fault-tolerant cluster management

  • Author

    Gidenstam, Anders ; Koldehofe, Boris ; Papatriantafilou, Marina ; Tsigas, Philippas

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chalmers Univ. of Technol., Goteborg, Sweden
  • fYear
    2005
  • fDate
    31 Aug.-2 Sept. 2005
  • Firstpage
    237
  • Lastpage
    244
  • Abstract
    Recent decentralised event-based systems have focused on providing event delivery which scales with increasing number of processes. While the main focus of research has been on ensuring that processes maintain only a small amount of information on maintaining membership and routing, an important factor in achieving scalability for event-based peer-to-peer dissemination system is the number of events disseminated at the same time. This work presents a dynamic and fault tolerant cluster management method which can be used to coordinate concurrent access to resources in a peer-to-peer system. In the context of event-based dissemination systems the cluster management can be used to control the number of concurrently disseminated events. We present and analyse an algorithm implementing the proposed cluster management model in a fault-tolerant and decentralised way. The algorithm provides for each cluster a limited set of tickets. A process which has obtained a ticket may send events corresponding to the resources of the cluster. The algorithm guarantees that no two processes ever issue an event corresponding to the same ticket at the same time. The cluster management model on its own has interesting properties which can be useful for many peer-to-peer applications.
  • Keywords
    computer network management; fault tolerant computing; peer-to-peer computing; workstation clusters; concurrent access; decentralised event-based system; dynamic cluster management; event delivery; event-based peer-to-peer dissemination system; fault-tolerant cluster management; peer-to-peer application; Algorithm design and analysis; Clustering algorithms; Collaboration; Control systems; Fault tolerance; Fault tolerant systems; Peer to peer computing; Resource management; Routing; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Peer-to-Peer Computing, 2005. P2P 2005. Fifth IEEE International Conference on
  • Print_ISBN
    0-7695-2376-5
  • Type

    conf

  • DOI
    10.1109/P2P.2005.6
  • Filename
    1551046