Title of article :
Fuzzy semi-supervised co-clustering for text documents
Author/Authors :
Yan، نويسنده , , Yang and Chen، نويسنده , , Lihui and Tjhi، نويسنده , , William-Chandra Tjhi، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2013
Pages :
16
From page :
74
To page :
89
Abstract :
In this paper we propose a new heuristic semi-supervised fuzzy co-clustering algorithm (SS-HFCR) for categorization of large web documents. In this approach, the clustering process is carried out by incorporating some prior knowledge in the form of pair-wise constraints provided by users into the fuzzy co-clustering framework. Each constraint specifies whether a pair of documents “must” or “cannot” be clustered together. Moreover, we formulate the competitive agglomeration cost function which is also able to make use of prior knowledge in the clustering process. The experimental studies on a number of large benchmark datasets demonstrate the strength and potentials of SS-HFCR in terms of accuracy, stability and efficiency, compared with some of the recent popular semi-supervised clustering approaches.
Keywords :
Must-link/cannot-link constraint , heuristic , semi-supervised learning , Fuzzy co-clustering
Journal title :
FUZZY SETS AND SYSTEMS
Serial Year :
2013
Journal title :
FUZZY SETS AND SYSTEMS
Record number :
1601641
Link To Document :
بازگشت