DocumentCode :
812110
Title :
Discovering Event Evolution Graphs From News Corpora
Author :
Yang, Christopher C. ; Shi, Xiaodong ; Wei, Chih-Ping
Author_Institution :
Coll. of Inf. Sci. & Technol., Drexel Univ., Philadelphia, PA
Volume :
39
Issue :
4
fYear :
2009
fDate :
7/1/2009 12:00:00 AM
Firstpage :
850
Lastpage :
863
Abstract :
Given the advance of Internet technologies, we can now easily extract hundreds or thousands of news stories of any ongoing incidents from newswires such as CNN.com, but the volume of information is too large for us to capture the blueprint. Information retrieval techniques such as topic detection and tracking are able to organize news stories as events, in a flat hierarchical structure, within a topic. However, they are incapable of presenting the complex evolution relationships between the events. We are interested to learn not only what the major events are but also how they develop within the topic. It is beneficial to identify the seminal events, the intermediary and ending events, and the evolution of these events. In this paper, we propose to utilize the event timestamp, event content similarity, temporal proximity, and document distributional proximity to model the event evolution relationships between events in an incident. An event evolution graph is constructed to present the underlying structure of events for efficient browsing and extracting of information. Case study and experiments are presented to illustrate and show the performance of our proposed technique. It is found that our proposed technique outperforms the baseline technique and other comparable techniques in previous work.
Keywords :
Internet; graph theory; information retrieval; text analysis; Internet; document distributional proximity; event content similarity; event evolution graph; event timestamp; information retrieval technique; news stories; temporal proximity; topic detection; topic tracking; Event evolution; event evolution graph; graph pruning; topic detection and tracking (TDT);
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4427
Type :
jour
DOI :
10.1109/TSMCA.2009.2015885
Filename :
4909011
Link To Document :
بازگشت