Title :
Text clustering using fuzzy neighborhood and evaluation of clusters
Author :
Zhang Canlun ; Miyamoto, Sadaaki
Author_Institution :
Master Program in Risk Eng., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
This study is concerned with text clustering and evaluating the clusters. Fuzzy neighborhood, a method of text mining, is used for this purpose. Five different clustering algorithms are used, they are kernel affinity propagation, kernel hard c-means, kernel fuzzy c-means, kernel hard k-means++ and kernel variable size hard c-means. These algorithms are applied to the clustering of nouns and adjectives in Chinese documents and SNS. The results from the previous four algorithms except kernel affinity propagation are evaluated by Rand index.
Keywords :
data mining; fuzzy set theory; natural language processing; pattern clustering; text analysis; Chinese documents; SNS; cluster evaluation; fuzzy neighborhood; kernel affinity propagation; kernel fuzzy c-means; kernel hard c-means; kernel hard k-means++; kernel variable size hard c-means; text clustering; text mining; Clustering algorithms; Clustering methods; Companies; Indexes; Kernel; Linear programming; Text mining; clustering; evaluation; fuzzy neighborhood; text mining;
Conference_Titel :
Granular Computing (GrC), 2014 IEEE International Conference on
Conference_Location :
Noboribetsu
DOI :
10.1109/GRC.2014.6982800