مرکز منطقه ای اطلاع رساني علوم و فناوري - Constrained clustering with Minkowski Weighted K-Means

DocumentCode :

1930577

Title :

Constrained clustering with Minkowski Weighted K-Means

Author :

de Amorim, R.C.

Author_Institution :

Dept. of Comput. Sci. & Inf. Syst., Birkbeck, Univ. of London, London, UK

fYear :

2012

fDate :

20-22 Nov. 2012

Firstpage :

Lastpage :

Abstract :

In this paper we introduce the Constrained Minkowski Weighted K-Means. This algorithm calculates cluster specific feature weights that can be interpreted as feature rescaling factors thanks to the use of the Minkowski distance. Here, we use an small amount of labelled data to select a Minkowski exponent and to generate clustering constrains based on pair-wise must-link and cannot-link rules. We validate our new algorithm with a total of 12 datasets, most of which containing features with uniformly distributed noise. We have run the algorithm numerous times in each dataset. These experiments ratify the general superiority of using feature weighting in K-Means, particularly when applying the Minkowski distance. We have also found that the use of constrained clustering rules has little effect on the average proportion of correctly clustered entities. However, constrained clustering does improve considerably the maximum of such proportion.

Keywords :

pattern clustering; Minkowski distance; Minkowski exponent; Minkowski weighted K-means; cannot-link rules; clustering constrain; constrained clustering; feature rescaling factor; feature weighting; pair-wise must-link rules; Constrained K-Means; Minkowski Weighted K-Means; Minkowski metric; feature weighting; semi-supervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence and Informatics (CINTI), 2012 IEEE 13th International Symposium on

Conference_Location :

Budapest

Print_ISBN :

978-1-4673-5205-5

Electronic_ISBN :

978-1-4673-5210-9

Type :

conf

DOI :

10.1109/CINTI.2012.6496753

Filename :

6496753

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1930577