Title :
LeadLine: Interactive visual analysis of text data through event identification and exploration
Author :
Wenwen Dou ; Xiaoyu Wang ; Skau, D. ; Ribarsky, William ; Zhou, Michelle X.
Author_Institution :
Univ. of North Carolina at Charlotte, Charlotte, NC, USA
Abstract :
Text data such as online news and microblogs bear valuable insights regarding important events and responses to such events. Events are inherently temporal, evolving over time. Existing visual text analysis systems have provided temporal views of changes based on topical themes extracted from text data. But few have associated topical themes with events that cause the changes. In this paper, we propose an interactive visual analytics system, LeadLine, to automatically identify meaningful events in news and social media data and support exploration of the events. To characterize events, LeadLine integrates topic modeling, event detection, and named entity recognition techniques to automatically extract information regarding the investigative 4 Ws: who, what, when, and where for each event. To further support analysis of the text corpora through events, LeadLine allows users to interactively examine meaningful events using the 4 Ws to develop an understanding of how and why. Through representing large-scale text corpora in the form of meaningful events, LeadLine provides a concise summary of the corpora. LeadLine also supports the construction of simple narratives through the exploration of events. To demonstrate the efficacy of LeadLine in identifying events and supporting exploration, two case studies were conducted using news and social media data.
Keywords :
information retrieval; social networking (online); text analysis; LeadLine; automatically identify meaningful events; entity recognition techniques; event exploration; event identification; information extraction; interactive visual analysis; interactive visual analytics system; large-scale text corpora; microblogs; news data; online news; social media data; support exploration; text data; visual text analysis systems; Crawlers; Data mining; Event detection; Lead; Time series analysis; Twitter; Visualization;
Conference_Titel :
Visual Analytics Science and Technology (VAST), 2012 IEEE Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
978-1-4673-4752-5
DOI :
10.1109/VAST.2012.6400485