• DocumentCode
    1312900
  • Title

    Probabilistic Topic Models

  • Author

    Blei, David ; Carin, Lawrence ; Dunson, David

  • Author_Institution
    Comput. Sci., Princeton Univ., Princeton, NJ, USA
  • Volume
    27
  • Issue
    6
  • fYear
    2010
  • Firstpage
    55
  • Lastpage
    65
  • Abstract
    In this article, we review probabilistic topic models: graphical models that can be used to summarize a large collection of documents with a smaller number of distributions over words. Those distributions are called "topics" because, when fit to data, they capture the salient themes that run through the collection. We describe both finite-dimensional parametric topic models and their Bayesian nonparametric counterparts, which are based on the hierarchical Dirichlet process (HDP). We discuss two extensions of topic models to time-series data-one that lets the topics slowly change over time and one that lets the assumed prevalence of the topics change. Finally, we illustrate the application of topic models to nontext data, summarizing some recent research results in image analysis.
  • Keywords
    Bayes methods; document image processing; probability; time series; Bayesian nonparametric counterparts; finite-dimensional parametric topic models; graphical model; hierarchical Dirichlet process; image analysis; probabilistic topic model; time-series; Analytical models; Bayesian methods; Computational modeling; Data models; Graphical models; Markov processes;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1053-5888
  • Type

    jour

  • DOI
    10.1109/MSP.2010.938079
  • Filename
    5563111