• DocumentCode
    1780589
  • Title

    Subspace clustering of dimensionality-reduced data

  • Author

    Heckel, Reinhard ; Tschannen, Michael ; Bolcskei, Helmut

  • Author_Institution
    ETH Zurich, Zurich, Switzerland
  • fYear
    2014
  • fDate
    June 29 2014-July 4 2014
  • Firstpage
    2997
  • Lastpage
    3001
  • Abstract
    Subspace clustering refers to the problem of clustering unlabeled high-dimensional data points into a union of low-dimensional linear subspaces, assumed unknown. In practice one may have access to dimensionality-reduced observations of the data only, resulting, e.g., from “undersampling” due to complexity and speed constraints on the acquisition device. More pertinently, even if one has access to the high-dimensional data set it is often desirable to first project the data points into a lower-dimensional space and to perform the clustering task there; this reduces storage requirements and computational cost. The purpose of this paper is to quantify the impact of dimensionality-reduction through random projection on the performance of the sparse subspace clustering (SSC) and the thresholding based subspace clustering (TSC) algorithms. We find that for both algorithms dimensionality reduction down to the order of the subspace dimensions is possible without incurring significant performance degradation. The mathematical engine behind our theorems is a result quantifying how the affinities between subspaces change under random dimensionality reducing projections.
  • Keywords
    data acquisition; data reduction; pattern clustering; storage management; SSC; TSC algorithm; acquisition device; dimensionality reducing projection; dimensionality-reduced data; dimensionality-reduced observation; dimensionality-reduction; low-dimensional linear subspaces; mathematical engine; random projection; sparse subspace clustering; speed constraint; storage requirement; thresholding based subspace clustering algorithm; unlabeled high-dimensional data points; Clustering algorithms; Data models; Engines; High definition video; Information theory; Lighting; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory (ISIT), 2014 IEEE International Symposium on
  • Conference_Location
    Honolulu, HI
  • Type

    conf

  • DOI
    10.1109/ISIT.2014.6875384
  • Filename
    6875384