Title :
Improved K-Modes Clustering Method Based on Chi-square Statistics
Author_Institution :
Coll. of Inf. Eng., NanChang Inst. of Technol., Nanchang, China
Abstract :
This paper proposes an improved K-Modes clustering method based on Chi-square statistics, using Chi-square statistics to characterize the relationship between the attributes of data objects. On this basis, the new distance measure is proposed, The distance measure method not only take into account the value of an attribute of an object different from itself, but also take into account other attributes´ influence, which can meet the practical problems. The experimental results show that the proposed clustering method is effective, it can improve the accuracy of the clustering.
Keywords :
data mining; pattern clustering; set theory; statistical analysis; Chi-square statistics; data object attribute; distance measure method; k-modes clustering method; Accuracy; Breast cancer; Clustering algorithms; Clustering methods; Correlation; Data mining; Mutual information; Chi-square statistics; K-Modes clustering method;
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
DOI :
10.1109/GrC.2010.66