Title :
An Efficient Algorithm of Hot Events Detection in Text Streams
Author :
Bai, Junliang ; Guo, Jun ; Chen, Guang ; Xu, Weiran ; Du, Gang
Author_Institution :
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Hot events detection in text streams has drawn increasing attention in recent sequential data mining works. Different from traditional TDT task which find all the real events´ cluster, hot events detection only identify hot events concerned by public. This paper proposes a novel approach to identify those events based on burst terms, terms co-occurrence and generative probabilistic model. Experiments with huge text stream sets crawled from WWW suggest that our algorithm can work on-line and identify hot events effectively and efficiently.
Keywords :
data mining; text analysis; burst terms; efficient algorithm; generative probabilistic model; hot events detection; sequential data mining; terms cooccurrence; text streams; Algorithm design and analysis; Clustering algorithms; Computational modeling; Earthquakes; Event detection; Helium; Web sites; Algorithm; Data Mining; Hot Events Detection;
Conference_Titel :
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2010 International Conference on
Conference_Location :
Huangshan
Print_ISBN :
978-1-4244-8434-8
Electronic_ISBN :
978-0-7695-4235-5
DOI :
10.1109/CyberC.2010.65