DocumentCode :
506837
Title :
Spectral Clustering for Chinese Word
Author :
Liu, Ying ; Nan, Wang ; Zheng, Tie
Author_Institution :
Dept. of Chinese Language & Literature, Tsinghua Univ., Beijing, China
Volume :
1
fYear :
2009
fDate :
14-16 Aug. 2009
Firstpage :
529
Lastpage :
533
Abstract :
The similarity between words is used for word clustering. In spectral clustering algorithms, the information contained in the eigenvectors of an affinity matrix is used to detect the similarity. Compared with traditional clustering methods, spectral clustering performs much better for clustering the words especially in multidimensional vector spaces. the spectral clustering is implemented by Visual C++ and Matlab in the paper, which is applied to cluster small scale segmented Chinese corpus and large scale non-segmented Chinese corpus. good experimental results are observed and result analysis are given for spectral clustering.
Keywords :
C++ language; eigenvalues and eigenfunctions; matrix algebra; pattern clustering; word processing; Chinese word; Matlab; Visual C++; eigenvectors; multidimensional vector spaces; spectral clustering algorithms; Clustering algorithms; Clustering methods; Fuzzy systems; Large-scale systems; Multidimensional systems; Mutual information; Natural languages; Partitioning algorithms; Sun; Symmetric matrices; spectral clustering; word clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
Type :
conf
DOI :
10.1109/FSKD.2009.792
Filename :
5358511
Link To Document :
بازگشت