• DocumentCode
    424195
  • Title

    SVM and reduction-based two algorithms for examining and eliminating mistakes in inconsistent examples

  • Author

    Feng, Hong-Hai ; Liao, Ming Yi ; Chen, Guo-Shun ; Yang, Bing-ru ; Chen, Yu-Mei

  • Author_Institution
    Urban & Rural Constr. Sch., Hebei Agric. Univ., Baoding, China
  • Volume
    4
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    2189
  • Abstract
    Currently, misleading findings during data mining are mostly caused by the dirty data including that caused by mistakes of inconsistent examples. A reduction-based algorithm for helping us to check the mistakes hidden in inconsistent examples is presented, i.e., firstly, through reduction to eliminate the attributes which have no relation to the inconsistencies and leave behind the ones which have some relations to the inconsistencies; secondly, to examine whether there are some mistakes of the core attribute values of the inconsistent examples in the reduction; finally, to examine the non-core attribute values of the inconsistent examples. When the amount of the inconsistent examples is large, the algorithm is especially necessary. In addition, a SVM based algorithm is presented for eliminating the inconsistent examples in which there are mistakes. Using SVM, the consistent examples are trained first, the inconsistent examples are tested, the inconsistent examples that fall into the test class are left behind, and the inconsistent examples that do not belong to the test class are eliminated.
  • Keywords
    data mining; rough set theory; support vector machines; data mining; reduction-based algorithms; rough set theory; support vector machines; Agricultural engineering; Chemical engineering; Chemical technology; Data engineering; Economic forecasting; Electronic mail; Finance; Optical fiber devices; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382161
  • Filename
    1382161