• DocumentCode
    70706
  • Title

    Joint Local and Global Consistency on Interdocument and Interword Relationships for Co-Clustering

  • Author

    Bing-Kun Bao ; Weiqing Min ; Teng Li ; Changsheng Xu

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Beijing, China
  • Volume
    45
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 2015
  • Firstpage
    15
  • Lastpage
    28
  • Abstract
    Co-clustering has recently received a lot of attention due to its effectiveness in simultaneously partitioning words and documents by exploiting the relationships between them. However, most of the existing co-clustering methods neglect or only partially reveal the interword and interdocument relationships. To fully utilize those relationships, the local and global consistencies on both word and document spaces need to be considered, respectively. Local consistency indicates that the label of a word/document can be predicted from its neighbors, while global consistency enforces a smoothness constraint on words/documents labels over the whole data manifold. In this paper, we propose a novel co-clustering method, called co-clustering via local and global consistency, to not only make use of the relationship between word and document, but also jointly explore the local and global consistency on both word and document spaces, respectively. The proposed method has the following characteristics: 1) the word-document relationships is modeled by following information-theoretic co-clustering (ITCC); 2) the local consistency on both interword and interdocument relationships is revealed by a local predictor; and 3) the global consistency on both interword and interdocument relationships is explored by a global smoothness regularization. All the fitting errors from these three-folds are finally integrated together to formulate an objective function, which is iteratively optimized by a convergence provable updating procedure. The extensive experiments on two benchmark document datasets validate the effectiveness of the proposed co-clustering method.
  • Keywords
    iterative methods; optimisation; pattern clustering; word processing; ITCC; document partitioning; information-theoretic coclustering; iterative optimization; objective function; smoothness regularization; word partitioning; word-document relationships; Joints; Kernel; Manifolds; Matrix decomposition; Mutual information; Random variables; Vectors; Co-clustering; information theory; local and global learning;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2317514
  • Filename
    6844822