• DocumentCode
    2773107
  • Title

    Clustering with Multiple Graphs

  • Author

    Tang, Wei ; Lu, Zhengdong ; Dhillon, Inderjit S.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    1016
  • Lastpage
    1021
  • Abstract
    In graph-based learning models, entities are often represented as vertices in an undirected graph with weighted edges describing the relationships between entities. In many real-world applications, however, entities are often associated with relations of different types and/or from different sources, which can be well captured by multiple undirected graphs over the same set of vertices. How to exploit such multiple sources of information to make better inferences on entities remains an interesting open problem. In this paper, we focus on the problem of clustering the vertices based on multiple graphs in both unsupervised and semi-supervised settings. As one of our contributions, we propose Linked Matrix Factorization (LMF) as a novel way of fusing information from multiple graph sources. In LMF, each graph is approximated by matrix factorization with a graph-specific factor and a factor common to all graphs, where the common factor provides features for all vertices. Experiments on SIAM journal data show that (1) we can improve the clustering accuracy through fusing multiple sources of information with several models, and (2) LMF yields superior or competitive results compared to other graph-based clustering methods.
  • Keywords
    graph theory; pattern clustering; unsupervised learning; graph based learning models; graph clustering; linked matrix factorization; semi-supervised learning; undirected graph; unsupervised learning; Clustering algorithms; Clustering methods; Data engineering; Data mining; Industrial relations; Inference algorithms; Information resources; Machine learning; Mathematics; Supervised learning; clustering; graph; multiple sources; semi-supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.125
  • Filename
    5360349