• DocumentCode
    67245
  • Title

    The Supervised Hierarchical Dirichlet Process

  • Author

    Dai, Andrew M. ; Storkey, Amos J.

  • Author_Institution
    Google Inc., Mountain View, CA, USA
  • Volume
    37
  • Issue
    2
  • fYear
    2015
  • fDate
    Feb. 1 2015
  • Firstpage
    243
  • Lastpage
    255
  • Abstract
    We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative model for the joint distribution of a group of observations and a response variable directly associated with that whole group. We compare the sHDP with another leading method for regression on grouped data, the supervised latent Dirichlet allocation (sLDA) model. We evaluate our method on two real-world classification problems and two real-world regression problems. Bayesian nonparametric regression models based on the Dirichlet process, such as the Dirichlet process-generalised linear models (DP-GLM) have previously been explored; these models allow flexibility in modelling nonlinear relationships. However, until now, hierarchical Dirichlet process (HDP) mixtures have not seen significant use in supervised problems with grouped data since a straightforward application of the HDP on the grouped data results in learnt clusters that are not predictive of the responses. The sHDP solves this problem by allowing for clusters to be learnt jointly from the group structure and from the label assigned to each group.
  • Keywords
    Bayes methods; pattern classification; regression analysis; stochastic processes; Bayesian nonparametric regression models; DP-GLM; Dirichlet process-generalised linear models; classification problems; group structure; nonparametric generative model; observation group joint distribution; response variable; sHDP; sLDA model; supervised hierarchical Dirichlet process; supervised latent Dirichlet allocation model; Adaptation models; Analytical models; Bayes methods; Data models; Predictive models; Resource management; Vocabulary; Bayesian nonparametrics; hierarchical Dirichlet process; latent Dirichlet allocation; topic modelling;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2014.2315802
  • Filename
    6784083