• DocumentCode
    1762545
  • Title

    Entropy-Rate Clustering: Cluster Analysis via Maximizing a Submodular Function Subject to a Matroid Constraint

  • Author

    Ming-Yu Liu ; Tuzel, Oncel ; Ramalingam, S. ; Chellappa, Rama

  • Author_Institution
    Mitsubishi Electr. Res. Labs. (MERL), Mitsubishi Electr. Corp., Cambridge, MA, USA
  • Volume
    36
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    99
  • Lastpage
    112
  • Abstract
    We propose a new objective function for clustering. This objective function consists of two components: the entropy rate of a random walk on a graph and a balancing term. The entropy rate favors formation of compact and homogeneous clusters, while the balancing function encourages clusters with similar sizes and penalizes larger clusters that aggressively group samples. We present a novel graph construction for the graph associated with the data and show that this construction induces a matroid--a combinatorial structure that generalizes the concept of linear independence in vector spaces. The clustering result is given by the graph topology that maximizes the objective function under the matroid constraint. By exploiting the submodular and monotonic properties of the objective function, we develop an efficient greedy algorithm. Furthermore, we prove an approximation bound of 1/2 for the optimality of the greedy solution. We validate the proposed algorithm on various benchmarks and show its competitive performances with respect to popular clustering algorithms. We further apply it for the task of superpixel segmentation. Experiments on the Berkeley segmentation data set reveal its superior performances over the state-of-the-art superpixel segmentation algorithms in all the standard evaluation metrics.
  • Keywords
    entropy; graph theory; greedy algorithms; image segmentation; pattern clustering; Berkeley segmentation data set; approximation bound; balancing function; balancing term; cluster analysis; clustering algorithms; combinatorial structure; compact clusters; entropy-rate clustering; graph construction; graph topology; greedy algorithm; greedy solution; homogeneous clusters; linear independence; matroid constraint; monotonic properties; random walk; standard evaluation metrics; submodular function; submodular properties; superpixel segmentation; vector spaces; Algorithm design and analysis; Clustering algorithms; Entropy; Image segmentation; Linear programming; Topology; Uncertainty; Clustering; discrete optimization; graph theory; information theory; submodular function; superpixel segmentation;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.107
  • Filename
    6529077