DocumentCode
2261336
Title
Mining Hot Topics from Free-Text Customer Reviews An LDA-Based Approach
Author
Yu, Chuanming ; Zhang, Xiaoqing ; Luo, Huiting
Author_Institution
Sch. of Inf. & Security Eng., Zhongnan Univ. of Econ. & Law, Wuhan, China
fYear
2010
fDate
20-22 Aug. 2010
Firstpage
85
Lastpage
89
Abstract
This study examines how the Latent Dirichlet Allocation (LDA) model combined with natural language processing techniques can be used to identify hot topics from free-text customer reviews. To verify the validity of the proposed approach, 21 580 restaurant reviews are collected. Each review is viewed as a probabilistic mixture of latent topics and each topic is treated as a probability distribution over words in a vocabulary. Parameters are estimated with Gibbs sampling, and the hot topics with top words are acquired. The experiments show that this approach could produce satisfactory results.
Keywords
customer profiles; data mining; natural language processing; probability; vocabulary; Gibbs sampling; LDA model; LDA-based approach; free-text customer reviews; hot topic mining; latent Dirichlet allocation; natural language processing techniques; parameter estimation; probabilistic mixture; probability distribution; restaurant reviews; vocabulary; Dairy products; Feature extraction; Hidden Markov models; Internet; Markov processes; Probability distribution; Vocabulary; Gibbs Sampling; Hot Topic Detection; Latent Dirichlet Allocatio; User Reviews;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Systems and Applications Conference (WISA), 2010 7th
Conference_Location
Hohhot
Print_ISBN
978-1-4244-8440-9
Type
conf
DOI
10.1109/WISA.2010.20
Filename
5581396
Link To Document