DocumentCode
70706
Title
Joint Local and Global Consistency on Interdocument and Interword Relationships for Co-Clustering
Author
Bing-Kun Bao ; Weiqing Min ; Teng Li ; Changsheng Xu
Author_Institution
Nat. Lab. of Pattern Recognition, Beijing, China
Volume
45
Issue
1
fYear
2015
fDate
Jan. 2015
Firstpage
15
Lastpage
28
Abstract
Co-clustering has recently received a lot of attention due to its effectiveness in simultaneously partitioning words and documents by exploiting the relationships between them. However, most of the existing co-clustering methods neglect or only partially reveal the interword and interdocument relationships. To fully utilize those relationships, the local and global consistencies on both word and document spaces need to be considered, respectively. Local consistency indicates that the label of a word/document can be predicted from its neighbors, while global consistency enforces a smoothness constraint on words/documents labels over the whole data manifold. In this paper, we propose a novel co-clustering method, called co-clustering via local and global consistency, to not only make use of the relationship between word and document, but also jointly explore the local and global consistency on both word and document spaces, respectively. The proposed method has the following characteristics: 1) the word-document relationships is modeled by following information-theoretic co-clustering (ITCC); 2) the local consistency on both interword and interdocument relationships is revealed by a local predictor; and 3) the global consistency on both interword and interdocument relationships is explored by a global smoothness regularization. All the fitting errors from these three-folds are finally integrated together to formulate an objective function, which is iteratively optimized by a convergence provable updating procedure. The extensive experiments on two benchmark document datasets validate the effectiveness of the proposed co-clustering method.
Keywords
iterative methods; optimisation; pattern clustering; word processing; ITCC; document partitioning; information-theoretic coclustering; iterative optimization; objective function; smoothness regularization; word partitioning; word-document relationships; Joints; Kernel; Manifolds; Matrix decomposition; Mutual information; Random variables; Vectors; Co-clustering; information theory; local and global learning;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2014.2317514
Filename
6844822
Link To Document