Title :
A Framework for Semi-supervised Clustering Based on Dimensionality Reduction
Author :
Cui, Peng ; Zhang, Ru-bo
Author_Institution :
Harbin Eng. Univ., Harbin, China
Abstract :
In machine learning and pattern recognition fields, collecting labeled examples costs human efforts, while vast amounts of unlabelled data are often readily available and offer some additional information. In practice, many applications require a dimensionality reduction method to deal with the partially labeled problem. In this paper, we propose a semi-supervised clustering framework, which is based on feature projection and semi-supervised fuzzy clustering. High dimensional data is mapped into a low dimensional space with feature projection. A mapping of learning data is generalized to a new datum which is classified in the low dimensional space with constrained fuzzy clustering. With the experiment on different datasets, the results show the method has good clustering performance for handling data of high dimensionality.
Keywords :
fuzzy set theory; learning (artificial intelligence); pattern classification; pattern clustering; classification; constrained fuzzy clustering; dimensionality reduction method; feature projection; high dimensional data; labeled example; low dimensional space; machine learning; pattern recognition; semisupervised fuzzy clustering; unlabelled data; Clustering algorithms; Computer science; Costs; Data engineering; Databases; Equations; Humans; Machine learning; Pattern recognition; Random variables; dimensionality reduction; pairwise constraints; projection; semi-supervised clustering;
Conference_Titel :
Database Technology and Applications, 2009 First International Workshop on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3604-0
DOI :
10.1109/DBTA.2009.107