Title :
Maximum Marginal Likelihood Estimation for Nonnegative Dictionary Learning in the Gamma-Poisson Model
Author :
Dikmen, Onur ; Févotte, Cédric
Author_Institution :
LTCI, Telecom ParisTech, Paris, France
Abstract :
In this paper we describe an alternative to standard nonnegative matrix factorization (NMF) for nonnegative dictionary learning, i.e., the task of learning a dictionary with nonnegative values from nonnegative data, under the assumption of nonnegative expansion coefficients. A popular cost function used for NMF is the Kullback-Leibler divergence, which underlies a Poisson observation model. NMF can thus be considered as maximization of the joint likelihood of the dictionary and the expansion coefficients. This approach lacks optimality because the number of parameters (which include the expansion coefficients) grows with the number of observations. In this paper we describe variational Bayes and Monte-Carlo EM algorithms for optimization of the marginal likelihood, i.e., the likelihood of the dictionary where the expansion coefficients have been integrated out (given a Gamma prior). We compare the output of both maximum joint likelihood estimation (i.e., standard NMF) and maximum marginal likelihood estimation (MMLE) on real and synthetical datasets. In particular we present face reconstruction results on CBCL dataset and text retrieval results over the musiXmatch dataset, a collection of word counts in song lyrics. The MMLE approach is shown to prevent overfitting by automatically pruning out irrelevant dictionary columns, i.e., embedding automatic model order selection.
Keywords :
Bayes methods; Monte Carlo methods; image reconstruction; matrix decomposition; maximum likelihood estimation; stochastic processes; Bayes algorithms; CBCL dataset; Kullback-Leibler divergence; Monte-Carlo EM algorithms; Poisson observation model; face reconstruction; gamma-poisson model; maximum joint likelihood estimation; maximum marginal likelihood estimation; musiXmatch dataset; nonnegative dictionary learning; nonnegative expansion coefficients; nonnegative matrix factorization; real datasets; synthetical datasets; text retrieval; word counts; Approximation algorithms; Approximation methods; Data models; Dictionaries; Estimation; Joints; Matrix decomposition; Automatic relevance determination; Monte Carlo EM; model order selection; nonnegative matrix factorization; sparse coding; variational EM;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2012.2207117