• DocumentCode
    2500005
  • Title

    Hierarchical adaptive distributed system-level diagnosis applied for SNMP-based network fault management

  • Author

    Duarte, Elias Procópio, Jr. ; Nanya, Takashi

  • Author_Institution
    Tokyo Inst. of Technol., Japan
  • fYear
    1996
  • fDate
    23-25 Oct 1996
  • Firstpage
    98
  • Lastpage
    107
  • Abstract
    Fault management is a key functional area of network management systems, but currently deployed applications often implement rudimentary diagnosis mechanisms. This paper presents a new hierarchical adaptive distributed system-level diagnosis (Hi-ADSD) algorithm and its implementation based on SNMP (simple network management protocol). Hi-ADSD is a fully distributed algorithm that has diagnosis latency of at most (log2N)2 testing rounds for a network of N nodes. Nodes are mapped into progressively larger logical clusters, so that each node executes tests in a hierarchical fashion. The algorithm assumes no link faults, a fully-connected network and imposes no bounds on the number of faults. Both the worst-case diagnosis latency and correctness of the algorithm are formally proved. Experimental results are given through simulation of the algorithm for large networks. The algorithm was implemented on a small network using SNMP. We present details of the implementation, including device fault management, the role of the network management station, and the diagnosis management information base
  • Keywords
    adaptive systems; computer network management; computer network reliability; distributed algorithms; fault diagnosis; hierarchical systems; local area networks; protocols; Hi-ADSD algorithm; LAN; distributed algorithm; hierarchical adaptive distributed system-level diagnosis; local area network; network fault management; network nodes; simple network management protocol; worst-case diagnosis latency; Adaptive systems; Clustering algorithms; Computer network management; Delay; Distributed algorithms; Fault diagnosis; Monitoring; Protocols; Technology management; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 1996. Proceedings., 15th Symposium on
  • Conference_Location
    Nigara-on-the-Lake, Ont.
  • ISSN
    1060-9857
  • Print_ISBN
    0-8186-7481-4
  • Type

    conf

  • DOI
    10.1109/RELDIS.1996.559703
  • Filename
    559703