DocumentCode :
2545936
Title :
Clustering Algorithm on Block Division of Documents
Author :
Liu, Gang ; Luo, Mingyue
Author_Institution :
Sch. of Electron. & Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2010
fDate :
23-25 Sept. 2010
Firstpage :
1
Lastpage :
4
Abstract :
In the traditional K-means algorithm, the selection of cluster number and the initial cluster center brings huge affection on the quality of clustering. To reduce the dependence on the initial center and to locate the types of new data rapidly, an algorithm applicable for text data is proposed. In this algorithm, document density is considered as parameter. Documents are divided into blocks first. After that every divided block is clustered separately. Experiment shows that this algorithm not only makes higher quality for clustering, but also does well in the new increasing data.
Keywords :
document handling; pattern clustering; K-means algorithm; clustering algorithm; clustering quality; document block division; document density; Algorithm design and analysis; Clustering algorithms; Computational modeling; Fluctuations; Internet; Partitioning algorithms; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless Communications Networking and Mobile Computing (WiCOM), 2010 6th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-3708-5
Electronic_ISBN :
978-1-4244-3709-2
Type :
conf
DOI :
10.1109/WICOM.2010.5600166
Filename :
5600166
Link To Document :
بازگشت