DocumentCode
2070086
Title
Maximizing the reliability of two-state automaton for burst feature detection in news streams
Author
Du, Gang ; Jun Guo ; Xu, Wei Ran ; Yang, Zhen
Author_Institution
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
Volume
1
fYear
2010
fDate
10-12 Dec. 2010
Firstpage
229
Lastpage
233
Abstract
The capture of temporal dynamics of news streams has drawn increasing attentions in recent sequential data mining works. Most of them are based on the intuition that a “burst” of a topic is signaled by a growth of relevant words in a high intensity during a period of time. Such “burst features” can be efficiently identified by Kleinberg´s two-state automaton model. The resolution is an important parameter of the model. It affects the reliability of the results greatly. This paper maximizes the reliability of the results by estimating adaptive resolution for each word with EM algorithm. Experiments with the public news corpora prove that the unified resolution is a bottleneck of the performance, and the results with word-adaptive resolutions approximate to the maximum reliability well.
Keywords
automata theory; data mining; expectation-maximisation algorithm; information resources; text analysis; EM algorithm; burst feature detection; maximum reliability; news streams; sequential data mining works; temporal dynamics; two-state automaton; Analytical models; Bridges; Cyclones; Mouth; Variable speed drives; EM algorithm; automaton; burst feature detection; temporal data mining; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Progress in Informatics and Computing (PIC), 2010 IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-6788-4
Type
conf
DOI
10.1109/PIC.2010.5687459
Filename
5687459
Link To Document