Title :
Improving performance of similarity-based clustering by feature weight learning
Author :
Yeung, D.S. ; Wang, X.Z.
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
fDate :
4/1/2002 12:00:00 AM
Abstract :
Similarity-based clustering is a simple but powerful technique which usually results in a clustering graph for a partitioning of threshold values in the unit interval. The guiding principle of similarity-based clustering is "similar objects are grouped in the same cluster." To judge whether two objects are similar, a similarity measure must be given in advance. The similarity measure presented in the paper is determined in terms of the weighted distance between the features of the objects. Thus, the clustering graph and its performance (which is described by several evaluation indices defined in the paper) will depend on the feature weights. The paper shows that, by using gradient descent technique to learn the feature weights, the clustering performance can be significantly improved. It is also shown that our method helps to reduce the uncertainty (fuzziness and nonspecificity) of the similarity matrix. This enhances the quality of the similarity-based decision making
Keywords :
fuzzy set theory; gradient methods; graph theory; matrix algebra; pattern clustering; clustering graph; feature weight learning; fuzziness; gradient descent technique; nonspecificity; partitioning; similar objects; similarity matrix; similarity-based clustering; similarity-based decision making; threshold values; uncertainty; weighted distance; Decision making; Uncertainty;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on