Title :
Design and implementation of Chinese words clustering based on atomic-concepts
Author :
Gao, Feng ; Li, Lei
Author_Institution :
Comput. Coll., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Cluster analysis is an important technique in statistics; it is an effective way to find hidden knowledge behind huge amounts of data. Especially now, in one hand, the amount of all kinds of papers is too huge to read; in the other hand, the extensive use of search engine technology also provides a huge number of words to find all kinds of information, and thus how to find the information from these words using words cluster become a meaningful issue. The method of words cluster in this paper is based on a certain improvement to the previous methods, which is the idea of putting forward atomic concepts, by calculating the similarity between the source words and the atomic concepts and weighted to obtain the clustering space. Then we use the FCM cluster algorithm of Matlab to calculate and achieve relatively good results.
Keywords :
pattern clustering; statistical analysis; word processing; Chinese words clustering; FCM cluster algorithm; Matlab; atomic-concepts; cluster analysis; fuzzy c-means; search engine technology; Artificial intelligence; Artificial neural networks; Clustering algorithms; Frequency modulation; Pediatrics; Artificial Intelligence; Atomic-concepts; Atomic-words[1]; Words cluster;
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
DOI :
10.1109/NLPKE.2011.6138215