DocumentCode :
39932
Title :
Jeffreys Centroids: A Closed-Form Expression for Positive Histograms and a Guaranteed Tight Approximation for Frequency Histograms
Author :
Nielsen, Frank
Author_Institution :
Sony Comput. Sci. Labs., Inc., Tokyo, Japan
Volume :
20
Issue :
7
fYear :
2013
fDate :
Jul-13
Firstpage :
657
Lastpage :
660
Abstract :
Due to the success of the bag-of-word modeling paradigm, clustering histograms has become an important ingredient of modern information processing. Clustering histograms can be performed using the celebrated k-means centroid-based algorithm. From the viewpoint of applications, it is usually required to deal with symmetric distances. In this letter, we consider the Jeffreys divergence that symmetrizes the Kullback-Leibler divergence, and investigate the computation of Jeffreys centroids. We first prove that the Jeffreys centroid can be expressed analytically using the Lambert W function for positive histograms. We then show how to obtain a fast guaranteed approximation when dealing with frequency histograms. Finally, we conclude with some remarks on the k-means histogram clustering.
Keywords :
approximation theory; document handling; pattern classification; pattern clustering; Jeffreys centroids; Kullback-Leibler divergence; Lambert W function; bag-of-word modeling paradigm; closed-form expression; document classification; frequency histograms; guaranteed tight approximation; histograms clustering; k-means centroid-based algorithm; k-means histogram clustering; positive histograms; Approximation algorithms; Approximation methods; Clustering algorithms; Databases; Histograms; Signal processing algorithms; Visualization; Centroid; Jeffreys divergence; Kullback–Leibler divergence; Lambert $W$ function; clustering; histogram;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2013.2260538
Filename :
6509930
Link To Document :
بازگشت