• DocumentCode
    1504101
  • Title

    Redundancy detection in semistructured case bases

  • Author

    Racine, Kirsti ; Yang, Qiang

  • Author_Institution
    IBM Canada Ltd., Toronto, Ont., Canada
  • Volume
    13
  • Issue
    3
  • fYear
    2001
  • Firstpage
    513
  • Lastpage
    518
  • Abstract
    With the dramatic proliferation of case-based reasoning systems in commercial applications, many case bases are now becoming legacy systems. They represent a significant portion of an organization´s assets, but they are large and difficult to maintain. One of the contributing factors is that these case bases are often large and yet unstructured or semistructured; they are represented in natural language text. Adding to the complexity is the fact that the case bases are often authored and updated by different people from a variety of knowledge sources, making it highly likely for a case base to contain redundant and inconsistent knowledge. We present methods and a system for maintaining large and semistructured case bases. We focus on a difficult problem in case base maintenance: redundancy detection. This problem is particularly pervasive when one deals with a semistructured case base. We discuss an information retrieval-based algorithm and an implemented system for solving this problem. As the ability to contain the knowledge acquisition problem is of paramount importance, our method allows one to express relevant domain expertise for detecting redundancy naturally and effortlessly. Empirical evaluations of the system demonstrate the effectiveness of the methods in several large domains
  • Keywords
    case-based reasoning; information retrieval; knowledge acquisition; knowledge based systems; knowledge verification; case base maintenance; case-based reasoning; data mining; inconsistent knowledge; information retrieval; knowledge acquisition; knowledge verification; legacy systems; natural language text; organization; redundancy detection; redundant knowledge; semistructured case bases; Computer aided software engineering; Expert systems; Knowledge acquisition; Mining industry; Natural languages; Printers; Problem-solving; Raw materials; Redundancy;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.929905
  • Filename
    929905