• DocumentCode
    1415089
  • Title

    Reasoning under Uncertainty for Overlay Fault Diagnosis

  • Author

    Tang, Yongning ; Al-Shaer, Ehab ; Joshi, Kaustubh

  • Author_Institution
    Sch. of Inf. Technol., Illinois State Univ., IL, USA
  • Volume
    9
  • Issue
    1
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    34
  • Lastpage
    47
  • Abstract
    The performance and reliability of overlay services rely on the underlying overlay network´s ability to effectively diagnose and recover from faults such as link failures and overlay node outages. However, overlay networks bring to fault diagnosis new challenges such as large-scale deployment, inaccessible underlay network information, dynamic symptom-fault causality relationship, and multi-layer complexity. In this paper, we develop an evidential overlay fault diagnosis framework called DigOver to tackle these challenges. Firstly, DigOver identifies a set of potential faulty components based on shared end-user observed negative symptoms. Then, each potential faulty component is evaluated to quantify its fault likelihood and the corresponding evaluation uncertainty. Finally, DigOver dynamically constructs a plausible fault graph to locate the root causes of end-user observed negative symptoms. Both simulation and Internet experiments demonstrate that DigOver can effectively and accurately diagnose overlay faults based on end-user observed negative symptoms.
  • Keywords
    Internet; computational complexity; computer network reliability; fault diagnosis; inference mechanisms; overlay networks; uncertainty handling; DigOver; Internet; dynamic symptom-fault causality relationship; evaluation uncertainty; fault recovery; inaccessible underlay network information; large scale deployment; multilayer complexity; overlay fault diagnosis; overlay network ability; overlay node outage; overlay service reliability; plausible fault graph; potential faulty component; shared end-user observed negative symptom; Cognition; Correlation; Fault diagnosis; Knowledge engineering; Monitoring; Network topology; Uncertainty; Overlay networks; dependable networks; fault diagnosis; uncertainty reasoning;
  • fLanguage
    English
  • Journal_Title
    Network and Service Management, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1932-4537
  • Type

    jour

  • DOI
    10.1109/TNSM.2011.010312.110126
  • Filename
    6122518