DocumentCode
3270208
Title
Interactive mining topic evolutionary patterns from internet forums
Author
Zhou, Bin ; Kai, Cui ; Jia, Yan ; Li, Jing
Author_Institution
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Volume
5
fYear
2010
fDate
22-24 June 2010
Abstract
In many real-world topic detection tasks, the process of the topic detection is often interactive, which means the users are likely to interfere the reason process by expressing their preferences. We proposed an algorithm, iOLDA, and the software framework for interactive topic evolution pattern detection based on Latent Dirichlet Allocation (LDA). To abate those topics not interested or related, it allows the users to add supervised information by adjusting the posterior topic-word distributions at the end of each iteration, which may influence the inference process of the next iteration. Experiments are conducted both on English and Chinese corpus and the results show that the extracted topics capture meaningful themes in the data, and the proposed interaction policies can help to discover better topics.
Keywords
Internet; data mining; iterative methods; natural language processing; user interfaces; Chinese corpus; English corpus; Internet; Latent Dirichlet Allocation; iOLDA; interactive mining; interactive topic evolution pattern; posterior topic-word distribution; topic detection; Clustering algorithms; Communication effectiveness; Computer science education; Data mining; Discussion forums; Educational technology; Electronic mail; Inference algorithms; Linear discriminant analysis; Software algorithms; data mining; probabilistic topic models; topic evolutionary patterns;
fLanguage
English
Publisher
ieee
Conference_Titel
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-6367-1
Type
conf
DOI
10.1109/ICETC.2010.5530005
Filename
5530005
Link To Document