DocumentCode :
2308214
Title :
A Robust Algorithm for Fuzzy Document Clustering
Author :
Chen, Lifei ; Wang, Shengrui ; Jiang, Qingshan
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou
fYear :
2009
fDate :
26-29 May 2009
Firstpage :
679
Lastpage :
684
Abstract :
In many applications of document clustering, a document may include multiple topics and thus may relate to multiple categories at the same time. Most of the existing subspace clustering algorithms can only perform hard clustering on document collections. In this paper, a fuzzy algorithm named R-FPC is introduced for document clustering. The algorithm discovers soft partitions of a data set in the soft subspaces of the data space. Using the proposed R-Greedy initialization method, R-FPC can always generate stable clustering results with competitive accuracy. The experiments are conducted on some widely used corpuses and the results have shown effectiveness and robustness of the proposed methods.
Keywords :
data analysis; document handling; fuzzy set theory; greedy algorithms; pattern clustering; R-Greedy initialization method; fuzzy document clustering; robust algorithm; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Flexible printed circuits; Iterative algorithms; Mathematics; Partitioning algorithms; Robustness; Software algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2009. WAINA '09. International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-3999-7
Electronic_ISBN :
978-0-7695-3639-2
Type :
conf
DOI :
10.1109/WAINA.2009.15
Filename :
5136727
Link To Document :
بازگشت