• DocumentCode
    1404075
  • Title

    Master Defect Record Retrieval Using Network-Based Feature Association

  • Author

    Rodriguez, Andrew ; Chaovalitwongse, W. Art ; Zhe, Liang ; Singhal, Harsh ; Pham, Hoang

  • Author_Institution
    Dept. of Ind. & Syst. Eng., Rutgers Univ., Piscataway, NJ, USA
  • Volume
    40
  • Issue
    3
  • fYear
    2010
  • fDate
    5/1/2010 12:00:00 AM
  • Firstpage
    319
  • Lastpage
    329
  • Abstract
    As electronic records (e.g., medical records and technical defect records) accumulate, the retrieval of a record from a past instance with the same or similar circumstances, has become extremely valuable. This is because a past record may contain the correct diagnosis or correct solution to the current circumstance. We refer to the two records of the same or similar circumstances as master and duplicate records. Current record retrieval techniques are lacking when applied to this special master defect record retrieval problem. In this study, we propose a new paradigm for master defect record retrieval using network-based feature association (NBFA). We train the master record retrieval process by constructing feature associations to limit the search space. The retrieval paradigm was employed and tested on a real-world large-scale defect record database from a telecommunications company. The empirical results suggest that the NBFA was able to significantly improve the performance of master record retrieval, and should be implemented in practice. This paper presents an overview of technical aspects of the master defect record retrieval problem, describes general methodologies for retrieval of master defect records, proposes a new feature association paradigm, provides performance assessments on real data from a telecommunications company, and highlights difficulties and challenges in this line of research that should be addressed in the future.
  • Keywords
    data mining; information retrieval; telecommunication industry; duplicate records; electronic records; master defect record retrieval; network-based feature association; telecommunications company; Information retrieval (IR); keyword library; keyword weighting; query; text mining; vector space model (VSM);
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1094-6977
  • Type

    jour

  • DOI
    10.1109/TSMCC.2010.2040079
  • Filename
    5406165