• DocumentCode
    2772375
  • Title

    Bayesian Overlapping Subspace Clustering

  • Author

    Fu, Qiang ; Banerjee, Arindam

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Minnesota, Twin Cities, MN, USA
  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    776
  • Lastpage
    781
  • Abstract
    Given a data matrix, the problem of finding dense/uniform sub-blocks in the matrix is becoming important in several applications. The problem is inherently combinatorial since the uniform sub-blocks may involve arbitrary subsets of rows and columns and may even be overlapping. While there are a few existing methods based on co-clustering or subspace clustering, they typically rely on local search heuristics and in general do not have a systematic model for such data. We present a Bayesian Overlapping Subspace Clustering (BOSC) model which is a hierarchical generative model for matrices with potentially overlapping uniform sub-block structures. The BOSC model can also handle matrices with missing entries. We propose an EM-style algorithm based on approximate inference using Gibbs sampling and parameter estimation using coordinate descent for the BOSC model. Through experiments on both simulated and real datasets, we demonstrate that the proposed algorithm outperforms the state-of-the-art.
  • Keywords
    data mining; matrix algebra; parameter estimation; pattern clustering; Bayesian overlapping subspace clustering; EM-style algorithm; Gibbs sampling; co-clustering; data matrix; local search heuristics; parameter estimation; Background noise; Bayesian methods; Cities and towns; Clustering algorithms; Computer science; Data engineering; Gene expression; Inference algorithms; Parameter estimation; Sparse matrices;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.132
  • Filename
    5360310