DocumentCode :
3324415
Title :
A text clustering algorithm based on simplified cluster hypothesis
Author :
Sun Yuan ; Guo Wenbin
Author_Institution :
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
fYear :
2013
fDate :
23-24 Dec. 2013
Firstpage :
412
Lastpage :
415
Abstract :
How to quickly and efficiently determine the subject category from a large amount of text is becoming an important challenge in text clustering. In this paper, One-Next text clustering algorithm based on the simplified cluster hypothesis is proposed. Meanwhile, a feature vector optimization method using grading feature vector extraction method is designed. Finally, the experimental results show that this method can get a high precession and F value, and the algorithm complexity is lower than other text clustering methods.
Keywords :
computational complexity; feature extraction; optimisation; pattern clustering; text analysis; vectors; algorithm complexity; feature vector optimization method; grading feature vector extraction method; one-next text clustering algorithm; simplified cluster hypothesis; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Clustering methods; Feature extraction; Time complexity; Vectors; VSM; feature vector optimization; text clustering; text similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Instrumentation and Measurement, Sensor Network and Automation (IMSNA), 2013 2nd International Symposium on
Conference_Location :
Toronto, ON
Type :
conf
DOI :
10.1109/IMSNA.2013.6743303
Filename :
6743303
Link To Document :
بازگشت