• DocumentCode
    3138862
  • Title

    Rough set based attribute reduction approach in data mining

  • Author

    Li, Kan ; Liu, Yu-shu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Beijing Inst. of Technol., China
  • Volume
    1
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    60
  • Abstract
    In previous attribute reduction researches, the criteria of reduction are intended that the numbers of attributes are the least, the last rules are the simplest or amount of reduction is the most. But in database theory, the criteria are that the redundancy of attributes and dependency of attributes are as few as possible. According to these, authors propose the rough set based attribute reduction algorithm. The decision table is judged firstly whether or not it is consistent. To the complete consistent table, using the knowledge of rough set and information theory, authors get attribute reduction set by discernibility matrix, and compute relevance of attributes through conditional entropy. The best attribute reduction is the set which value is the minimum of average of attribute relevance. To the complete inconsistent table, authors make directly the decision rules with rough operator. The experiment shows it can get better effect. Reduction results of UCI databases are gotten through using the algorithm.
  • Keywords
    data mining; database theory; entropy; information theory; matrix algebra; redundancy; rough set theory; UCI databases; attribute dependency; attribute reduction set; attribute redundancy; complete inconsistent table; conditional entropy; data mining; database theory; discernibility matrix; information theory; minimum average attribute relevance; rough set based attribute reduction approach; Computer science; Data engineering; Data mining; Databases; Electronic mail; Entropy; Fuzzy set theory; Information systems; Information theory; Set theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
  • Print_ISBN
    0-7803-7508-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2002.1176709
  • Filename
    1176709