DocumentCode :
3416546
Title :
Weighted Clustering Approach Based on Rough Set Similarity Model
Author :
Zhang, Qizhong ; Xi, Xugang
Author_Institution :
Coll. of Autom., Hangzhou Dianzi Univ., Hangzhou, China
Volume :
1
fYear :
2010
fDate :
23-24 Oct. 2010
Firstpage :
138
Lastpage :
142
Abstract :
In the process of clustering analysis, property characteristics are not equally important in the clustering process, and some characteristics are even redundant. Improper selection of characteristics may worsen effects of several classification methods. Therefore, correct selection of characteristics of great importance for clustering is crucial in improving clustering effects. Based on this, this paper proposes a new weighted clustering approach based on rough set similarity models, as well as a class purification approach based on information theory. First, weights of each property characteristic are initialized to 1, form primary equivalent classes with rough set similarity models, and then evaluate these property characteristics according to the information entropy theory, repeated weighted clustering are performed on each property with their entropy values, and finally obtain pattern classes satisfying clustering requirements. Tested by dataset from UCI, this algorithm is found to have obvious improvements in both classification rates and purity of each pattern class.
Keywords :
entropy; pattern clustering; rough set theory; class purification approach; information entropy theory; rough set similarity model; weighted clustering approach; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Information entropy; Information systems; Machine learning algorithms; Information Entropy; Rough Set Similarity Models; Weighted Clustering Approach;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
Type :
conf
DOI :
10.1109/AICI.2010.36
Filename :
5656585
Link To Document :
بازگشت