• DocumentCode
    2875235
  • Title

    Community-base Fault Diagnosis Using Incremental Belief Revision

  • Author

    Tang, Yongning ; Cheng, Guang ; Xu, Zhiwei ; Al-Shaer, Ehab

  • Author_Institution
    Illinois State Univ., Normal, IL, USA
  • fYear
    2009
  • fDate
    9-11 July 2009
  • Firstpage
    121
  • Lastpage
    128
  • Abstract
    Overlay networks have emerged as a powerful and flexible platform for developing new disruptive network applications. The attractive characteristics of overlay networks such as planetary-scale distributions, user-level flexibility (e.g., overlay routing) and manageability bring to overlay fault diagnosis new challenges, which include inaccessible underlying network information, incomplete and inaccurate network status observations; dynamic symptom-fault causality relationships, and multi-layer complexity. To address these challenges, we propose a distributed user-level Belief Revision based overlay fault diagnosis technique called EUDiag. EUDiag can passively use observed overlay symptoms as reported by overlay monitoring agents to correlate and diagnose faults, and select the least-costly appropriate probing actions whenever necessary to enhance the passive fault reasoning results. EUDiag adapts to the changes in highly dynamic overlay networks by incrementally revising user beliefs based on new observed overlay symptoms. EUDiag can diagnose faults without relying on underlying network fault probabilistic quantifications (e.g. prior fault probability).Simulations and experimental studies show that EUDiag can efficiently (e.g. low latency) and accurately localize root causes of overlay faults/problems, even when the observed symptoms are incomplete.
  • Keywords
    distributed programming; fault diagnosis; probability; user interfaces; EUDiag; community-base fault diagnosis; distributed user-level belief revision; incremental belief revision; network fault probabilistic quantifications; overlay fault diagnosis; overlay networks; Delay; Fault detection; Fault diagnosis; Monitoring; Network topology; Routing; Telecommunication network reliability; USA Councils; belief revision; fault diagnosis; overlay network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Architecture, and Storage, 2009. NAS 2009. IEEE International Conference on
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3741-2
  • Type

    conf

  • DOI
    10.1109/NAS.2009.24
  • Filename
    5197308