• DocumentCode
    1914030
  • Title

    End-to-end service failure diagnosis using belief networks

  • Author

    Steinder, M. ; Sethi, A.S.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Delaware Univ., Newark, DE, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    375
  • Lastpage
    390
  • Abstract
    We present fault localization techniques suitable for diagnosing end-to-end service problems in communication systems with complex topologies. We refine a layered system model that represents relationships between services and functions offered between neighboring protocol layers. In a given layer, an end-to-end service between two hosts may be provided using multiple host-to-host services offered in this layer between two hosts on the end-to-end path. Relationships among end-to-end and host-to-host services form a bipartite probabilistic dependency graph whose structure depends on the network topology in the corresponding protocol layer. When an end-to-end service fails or experiences performance problems it is important to efficiently find the responsible host-to-host services. Finding the most probable explanation (MPE) of the observed symptoms is NP-hard. We propose two fault localization techniques based on Pearl´s (1988) iterative algorithms for singly connected belief networks. The probabilistic dependency graph is transformed into a belief network, and then the approximations based on Pearl´s algorithms and exact bucket tree elimination algorithm are designed and evaluated through extensive simulation study.
  • Keywords
    belief networks; fault location; graph theory; iterative methods; network topology; probability; protocols; telecommunication network management; telecommunication network reliability; Pearl´s iterative algorithms; QoS guarantees; belief networks; bipartite probabilistic dependency graph; end-to-end service failure diagnosis; exact bucket tree elimination algorithm; fault localization techniques; host-to-host services; layered dependency graph; layered system model; most probable explanation; network availability; network fault management; network topology; nondeterministic reasoning; protocol layers; simulation; Availability; Collaboration; Computer networks; Fault diagnosis; Inference algorithms; Iterative algorithms; Network topology; Protocols; Quality of service; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Operations and Management Symposium, 2002. NOMS 2002. 2002 IEEE/IFIP
  • Print_ISBN
    0-7803-7382-0
  • Type

    conf

  • DOI
    10.1109/NOMS.2002.1015595
  • Filename
    1015595