Title of article :
Dynamic clustering of histogram data based on adaptive squared Wasserstein distances
Author/Authors :
Antonio Irpino، نويسنده , , Antonio and Verde، نويسنده , , Rosanna and De Carvalho، نويسنده , , Francisco de A.T.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
16
From page :
3351
To page :
3366
Abstract :
This paper presents a Dynamic Clustering Algorithm for histogram data with an automatic weighting step of the variables by using adaptive distances. The Dynamic Clustering Algorithm is a k-means-like algorithm for clustering a set of objects into a predefined number of classes. Histogram data are realizations of particular set-valued descriptors defined in the context of Symbolic Data Analysis. We propose to use the ℓ 2 Wasserstein distance for clustering histogram data and two novel adaptive distance based clustering schemes. The ℓ 2 Wasserstein distance allows to express the variability of a set of histograms in two components: the first related to the variability of their averages and the second to the variability of the histograms related to different size and shape. The weighting step aims to take into account global and local adaptive distances as well as two components of the variability of a set of histograms. To evaluate the clustering results, we extend some classic partition quality indexes when the proposed adaptive distances are used in the clustering criterion function. Examples on synthetic and real-world datasets corroborate the proposed clustering procedure.
Keywords :
Histogram data , Partitioning clustering method , Wasserstein distance , Symbolic data analysis , Adaptive distance
Journal title :
Expert Systems with Applications
Serial Year :
2014
Journal title :
Expert Systems with Applications
Record number :
2354654
Link To Document :
بازگشت