• DocumentCode
    3647655
  • Title

    Scalable Failure Management for Peer-to-Peer Networks

  • Author

    Catalin Leordeanu;Vlad Calina;Valentin Cristea

  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    767
  • Lastpage
    774
  • Abstract
    Failure management is a key component in the attempt to provide a reliable environment. This article proposes a solution to increase the reliability of distributed systems based on the Chord Peer-to-Peer overlay. our solution is aimed at providing accurate failure information about the nodes in the system. This is a very difficult task in Peer-to-peer networks due to their dynamic nature and the inability to obtain reliable data from failure detectors. We propose a failure history service used to share failure information between peer-to-peer nodes. This novel service ensures that the information about the current state of a node, as well as its failure history, is as accurate as possible even when facing a large number of node failures. This solution aims to increase the reliability of distributed systems based on the Chord peer-to-peer overlay by providing accurate data which can be used to analyze failures over time.
  • Keywords
    "Peer to peer computing","Detectors","History","Monitoring","Protocols","Software reliability"
  • Publisher
    ieee
  • Conference_Titel
    Complex, Intelligent and Software Intensive Systems (CISIS), 2012 Sixth International Conference on
  • Print_ISBN
    978-1-4673-1233-2
  • Type

    conf

  • DOI
    10.1109/CISIS.2012.193
  • Filename
    6245773