• DocumentCode
    507648
  • Title

    A Global Discretization and Attribute Reduction Algorithm Based on K-Means Clustering and Rough Sets Theory

  • Author

    Min, Hu

  • Author_Institution
    Sch. of Manage. & Econ., Beijing Inst. of Technol., Beijing, China
  • Volume
    2
  • fYear
    2009
  • fDate
    Nov. 30 2009-Dec. 1 2009
  • Firstpage
    92
  • Lastpage
    95
  • Abstract
    The knowledge reduction function of rough sets theory is specific on discrete data, while most attributes of decision tables are continuous. Therefore a global discretization and attribute reduction algorithm is proposed based on clustering and rough sets theory. After comparing different discretization methods, the k-means clustering algorithm is used. In order to avoid the shortcomings of k-means clustering algorithm, the F-analysis of variance statistics and support strength of condition attributes are introduced to control the discretization effectiveness. A rational clustering number is derived according to the dependency index to meet the prerequisite of the rough set theory. After that, the attributes are reduced by using rough set theory, and decision rules are induced. Lastly an example is proposed to illustrate the feasibility and effectiveness of the algorithm.
  • Keywords
    data reduction; decision tables; fuzzy set theory; rough set theory; statistical analysis; F-analysis; attribute reduction algorithm; decision rules; decision tables; dependency index; discrete data; global discretization algorithm; k-means clustering number; knowledge reduction function; rough sets theory; variance statistics; Clustering algorithms; Databases; Frequency conversion; Information systems; Knowledge acquisition; Knowledge management; Rough sets; Set theory; Statistics; Technology management; Attribute reduction; Decision table; Discretization; K-means clustering; Rough set;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3888-4
  • Type

    conf

  • DOI
    10.1109/KAM.2009.16
  • Filename
    5362285