DocumentCode
1282997
Title
The Burbea-Rao and Bhattacharyya Centroids
Author
Nielsen, Frank ; Boltz, Sylvain
Author_Institution
Dept. of Fundamental Res., Sony Comput. Sci. Labs., Inc., Tokyo, Japan
Volume
57
Issue
8
fYear
2011
Firstpage
5455
Lastpage
5466
Abstract
We study the centroid with respect to the class of information-theoretic Burbea-Rao divergences that generalize the celebrated Jensen-Shannon divergence by measuring the non-negative Jensen difference induced by a strictly convex and differentiable function. Although those Burbea-Rao divergences are symmetric by construction, they are not metric since they fail to satisfy the triangle inequality. We first explain how a particular symmetrization of Bregman divergences called Jensen-Bregman distances yields exactly those Burbea-Rao divergences. We then proceed by defining skew Burbea-Rao divergences, and show that skew Burbea-Rao divergences amount in limit cases to compute Bregman divergences. We then prove that Burbea-Rao centroids can be arbitrarily finely approximated by a generic iterative concave-convex optimization algorithm with guaranteed convergence property. In the second part of the paper, we consider the Bhattacharyya distance that is commonly used to measure overlapping degree of probability distributions. We show that Bhattacharyya distances on members of the same statistical exponential family amount to calculate a Burbea-Rao divergence in disguise. Thus we get an efficient algorithm for computing the Bhattacharyya centroid of a set of parametric distributions belonging to the same exponential families, improving over former specialized methods found in the literature that were limited to univariate or “diagonal” multivariate Gaussians. To illustrate the performance of our Bhattacharyya/Burbea-Rao centroid algorithm, we present experimental performance results for k-means and hierarchical clustering methods of Gaussian mixture models.
Keywords
Gaussian distribution; information theory; iterative methods; optimisation; probability; Bhattacharyya centroids; Bhattacharyya distance; Bregman divergences; Burbea-Rao centroids; Gaussian mixture; Jensen-Shannon divergence; convergence property; diagonal multivariate Gaussians; former specialized methods; generic iterative concave-convex optimization; hierarchical clustering; information-theoretic Burbea-Rao divergences; k-means clustering; nonnegative Jensen difference; parametric distributions; probability distributions; triangle inequality; Computer science; Convex functions; Entropy; Generators; Measurement; Optimized production technology; Bhattacharrya divergence; Bregman divergences; Burbea-Rao divergence; Jensen-Shannon divergence; Kullback-Leibler divergence; centroid; exponential families; information geometry;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.2011.2159046
Filename
5961839
Link To Document