• DocumentCode
    2261336
  • Title

    Mining Hot Topics from Free-Text Customer Reviews An LDA-Based Approach

  • Author

    Yu, Chuanming ; Zhang, Xiaoqing ; Luo, Huiting

  • Author_Institution
    Sch. of Inf. & Security Eng., Zhongnan Univ. of Econ. & Law, Wuhan, China
  • fYear
    2010
  • fDate
    20-22 Aug. 2010
  • Firstpage
    85
  • Lastpage
    89
  • Abstract
    This study examines how the Latent Dirichlet Allocation (LDA) model combined with natural language processing techniques can be used to identify hot topics from free-text customer reviews. To verify the validity of the proposed approach, 21 580 restaurant reviews are collected. Each review is viewed as a probabilistic mixture of latent topics and each topic is treated as a probability distribution over words in a vocabulary. Parameters are estimated with Gibbs sampling, and the hot topics with top words are acquired. The experiments show that this approach could produce satisfactory results.
  • Keywords
    customer profiles; data mining; natural language processing; probability; vocabulary; Gibbs sampling; LDA model; LDA-based approach; free-text customer reviews; hot topic mining; latent Dirichlet allocation; natural language processing techniques; parameter estimation; probabilistic mixture; probability distribution; restaurant reviews; vocabulary; Dairy products; Feature extraction; Hidden Markov models; Internet; Markov processes; Probability distribution; Vocabulary; Gibbs Sampling; Hot Topic Detection; Latent Dirichlet Allocatio; User Reviews;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Systems and Applications Conference (WISA), 2010 7th
  • Conference_Location
    Hohhot
  • Print_ISBN
    978-1-4244-8440-9
  • Type

    conf

  • DOI
    10.1109/WISA.2010.20
  • Filename
    5581396