• DocumentCode
    841583
  • Title

    Robust and Efficient Rule Extraction Through Data Summarization and Its Application in Welding Fault Diagnosis

  • Author

    Gong, Rongsheng ; Huang, Samuel H. ; Chen, Tieming

  • Author_Institution
    Dept. of Mech., Ind., & Nucl. Eng., Cincinnati Univ., Cincinnati, OH
  • Volume
    4
  • Issue
    3
  • fYear
    2008
  • Firstpage
    198
  • Lastpage
    206
  • Abstract
    This paper presents a robust and efficient method to discover knowledge for classification problems through data summarization. It discretizes continuous features and then summarizes the data using a contingency table. Inconsistency rate for different subsets of features can then be easily calculated from the contingency table. Sequential search is then used to find the best feature subset. After the number of features is reduced to a certain extent, easy-to-understand knowledge can be intuitively derived from data summary. Another desirable feature of the proposed method is its capability to learn incrementally; namely, knowledge can be updated quickly whenever new data are obtained. Moreover, the proposed method is capable of handling missing values when used for prediction. The method is applied on two benchmark data sets showing its effectiveness on selecting discriminative features. The practical usefulness of this method in manufacturing is demonstrated through an application on welding fault diagnosis.
  • Keywords
    data mining; fault diagnosis; pattern classification; production engineering computing; welding; contingency table; data summarization; discretizes continuous features; discriminative features; knowledge discovery; rule extraction; sequential search; welding fault diagnosis; Data summarization; fault diagnosis; feature selection; incremental learning; knowledge discovery; rule extraction; welding;
  • fLanguage
    English
  • Journal_Title
    Industrial Informatics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1551-3203
  • Type

    jour

  • DOI
    10.1109/TII.2008.2002920
  • Filename
    4604676