DocumentCode :
463992
Title :
Unsupervised Locally Embedded Clustering for Automatic High-Dimensional Data Labeling
Author :
Yun Fu ; Huang, Thomas S.
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Volume :
3
fYear :
2007
fDate :
15-20 April 2007
Abstract :
In most machine learning and pattern recognition problems, the large number of high-dimensional sensory data, such as images and videos, are often labeled manually for training classifiers and modeling features, which is time-consuming and tedious. To automatically execute this process by machine, we present the unsupervised high-dimensional data clustering and automatic labeling algorithms, called locally embedded clustering (LEC): (i) constructing the neighborhood weighted graph with an appropriate distance metric; (ii) tuning the regularization parameter to smooth the approximated manifold; (iii) calculating the unified projection in a closed-form solution for the embedding and dimensionality reduction; (iv) choosing the top or bottom coordinates of the embedded low-dimensional space for data representation; (v) normalizing the low-dimensional representation to have unit length; (vi) clustering and labeling the data via K-means. Experimental results demonstrate that LEC provides better data representation, more efficient dimensionality reduction and better clustering performance than many existing methods.
Keywords :
data handling; graph theory; K-means; automatic high-dimensional data labeling; automatic labeling algorithms; data representation; dimensionality reduction; high-dimensional sensory data; low-dimensional representation; machine learning problem; neighborhood weighted graph; pattern recognition problem; regularization parameter; unsupervised high-dimensional data clustering; unsupervised locally embedded clustering; Clustering algorithms; Focusing; Geometry; Labeling; Laplace equations; Machine learning; Machine learning algorithms; Manifolds; Pattern recognition; Videos; LEA; LEC; dimensionality reduction; high-dimensional data clustering; manifold;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366865
Filename :
4217895
Link To Document :
بازگشت