• DocumentCode
    463992
  • Title

    Unsupervised Locally Embedded Clustering for Automatic High-Dimensional Data Labeling

  • Author

    Yun Fu ; Huang, Thomas S.

  • Author_Institution
    Beckman Inst. for Adv. Sci. & Technol., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • Volume
    3
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    In most machine learning and pattern recognition problems, the large number of high-dimensional sensory data, such as images and videos, are often labeled manually for training classifiers and modeling features, which is time-consuming and tedious. To automatically execute this process by machine, we present the unsupervised high-dimensional data clustering and automatic labeling algorithms, called locally embedded clustering (LEC): (i) constructing the neighborhood weighted graph with an appropriate distance metric; (ii) tuning the regularization parameter to smooth the approximated manifold; (iii) calculating the unified projection in a closed-form solution for the embedding and dimensionality reduction; (iv) choosing the top or bottom coordinates of the embedded low-dimensional space for data representation; (v) normalizing the low-dimensional representation to have unit length; (vi) clustering and labeling the data via K-means. Experimental results demonstrate that LEC provides better data representation, more efficient dimensionality reduction and better clustering performance than many existing methods.
  • Keywords
    data handling; graph theory; K-means; automatic high-dimensional data labeling; automatic labeling algorithms; data representation; dimensionality reduction; high-dimensional sensory data; low-dimensional representation; machine learning problem; neighborhood weighted graph; pattern recognition problem; regularization parameter; unsupervised high-dimensional data clustering; unsupervised locally embedded clustering; Clustering algorithms; Focusing; Geometry; Labeling; Laplace equations; Machine learning; Machine learning algorithms; Manifolds; Pattern recognition; Videos; LEA; LEC; dimensionality reduction; high-dimensional data clustering; manifold;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366865
  • Filename
    4217895