• DocumentCode
    3609611
  • Title

    DisCaRia—Distributed Case-Based Reasoning System for Fault Management

  • Author

    Ha Manh Tran ; Schonwalder, Jurgen

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Int. Univ.-Vietnam Nat. Univ., Ho Chi Minh City, Vietnam
  • Volume
    12
  • Issue
    4
  • fYear
    2015
  • Firstpage
    540
  • Lastpage
    553
  • Abstract
    Fault resolution in communication networks and distributed systems is a challenge that demands the expertise of system administrators and the support of multiple systems, such as monitoring and event correlation systems. Trouble ticket systems are frequently used to organize the workflow of the fault resolution process. In this context, we introduce DisCaRia, a distributed case-based reasoning system that assists system administrators and network operators in resolving faults. DisCaRia integrates various fault knowledge resources that are already available in the Internet, and it exploits them by applying a distributed case-based reasoning methodology, which is based on scalable peer-to-peer technology. We present the architecture of DisCaRia, the key algorithms used by DisCaRia, and provide an evaluation of a prototype implementation of the system.
  • Keywords
    case-based reasoning; peer-to-peer computing; program debugging; DisCaRia; communication networks; distributed case-based reasoning system; distributed systems; event correlation systems; fault management; fault resolution; monitoring systems; scalable peer-to-peer technology; trouble ticket systems; Communication networks; Communication system operations and management; Fault diagnosis; Monitoring; Peer-to-peer computing; Fault resolution; bug tracking system; case-based reasoning; fault management; peer-to-peer; software bug search;
  • fLanguage
    English
  • Journal_Title
    Network and Service Management, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1932-4537
  • Type

    jour

  • DOI
    10.1109/TNSM.2015.2496224
  • Filename
    7313011