• DocumentCode
    1553827
  • Title

    Maximum Marginal Likelihood Estimation for Nonnegative Dictionary Learning in the Gamma-Poisson Model

  • Author

    Dikmen, Onur ; Févotte, Cédric

  • Author_Institution
    LTCI, Telecom ParisTech, Paris, France
  • Volume
    60
  • Issue
    10
  • fYear
    2012
  • Firstpage
    5163
  • Lastpage
    5175
  • Abstract
    In this paper we describe an alternative to standard nonnegative matrix factorization (NMF) for nonnegative dictionary learning, i.e., the task of learning a dictionary with nonnegative values from nonnegative data, under the assumption of nonnegative expansion coefficients. A popular cost function used for NMF is the Kullback-Leibler divergence, which underlies a Poisson observation model. NMF can thus be considered as maximization of the joint likelihood of the dictionary and the expansion coefficients. This approach lacks optimality because the number of parameters (which include the expansion coefficients) grows with the number of observations. In this paper we describe variational Bayes and Monte-Carlo EM algorithms for optimization of the marginal likelihood, i.e., the likelihood of the dictionary where the expansion coefficients have been integrated out (given a Gamma prior). We compare the output of both maximum joint likelihood estimation (i.e., standard NMF) and maximum marginal likelihood estimation (MMLE) on real and synthetical datasets. In particular we present face reconstruction results on CBCL dataset and text retrieval results over the musiXmatch dataset, a collection of word counts in song lyrics. The MMLE approach is shown to prevent overfitting by automatically pruning out irrelevant dictionary columns, i.e., embedding automatic model order selection.
  • Keywords
    Bayes methods; Monte Carlo methods; image reconstruction; matrix decomposition; maximum likelihood estimation; stochastic processes; Bayes algorithms; CBCL dataset; Kullback-Leibler divergence; Monte-Carlo EM algorithms; Poisson observation model; face reconstruction; gamma-poisson model; maximum joint likelihood estimation; maximum marginal likelihood estimation; musiXmatch dataset; nonnegative dictionary learning; nonnegative expansion coefficients; nonnegative matrix factorization; real datasets; synthetical datasets; text retrieval; word counts; Approximation algorithms; Approximation methods; Data models; Dictionaries; Estimation; Joints; Matrix decomposition; Automatic relevance determination; Monte Carlo EM; model order selection; nonnegative matrix factorization; sparse coding; variational EM;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2012.2207117
  • Filename
    6232464