Title :
Mining Hot Topics from Free-Text Customer Reviews An LDA-Based Approach
Author :
Yu, Chuanming ; Zhang, Xiaoqing ; Luo, Huiting
Author_Institution :
Sch. of Inf. & Security Eng., Zhongnan Univ. of Econ. & Law, Wuhan, China
Abstract :
This study examines how the Latent Dirichlet Allocation (LDA) model combined with natural language processing techniques can be used to identify hot topics from free-text customer reviews. To verify the validity of the proposed approach, 21 580 restaurant reviews are collected. Each review is viewed as a probabilistic mixture of latent topics and each topic is treated as a probability distribution over words in a vocabulary. Parameters are estimated with Gibbs sampling, and the hot topics with top words are acquired. The experiments show that this approach could produce satisfactory results.
Keywords :
customer profiles; data mining; natural language processing; probability; vocabulary; Gibbs sampling; LDA model; LDA-based approach; free-text customer reviews; hot topic mining; latent Dirichlet allocation; natural language processing techniques; parameter estimation; probabilistic mixture; probability distribution; restaurant reviews; vocabulary; Dairy products; Feature extraction; Hidden Markov models; Internet; Markov processes; Probability distribution; Vocabulary; Gibbs Sampling; Hot Topic Detection; Latent Dirichlet Allocatio; User Reviews;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2010 7th
Conference_Location :
Hohhot
Print_ISBN :
978-1-4244-8440-9
DOI :
10.1109/WISA.2010.20