• DocumentCode
    586478
  • Title

    LeadLine: Interactive visual analysis of text data through event identification and exploration

  • Author

    Wenwen Dou ; Xiaoyu Wang ; Skau, D. ; Ribarsky, William ; Zhou, Michelle X.

  • Author_Institution
    Univ. of North Carolina at Charlotte, Charlotte, NC, USA
  • fYear
    2012
  • fDate
    14-19 Oct. 2012
  • Firstpage
    93
  • Lastpage
    102
  • Abstract
    Text data such as online news and microblogs bear valuable insights regarding important events and responses to such events. Events are inherently temporal, evolving over time. Existing visual text analysis systems have provided temporal views of changes based on topical themes extracted from text data. But few have associated topical themes with events that cause the changes. In this paper, we propose an interactive visual analytics system, LeadLine, to automatically identify meaningful events in news and social media data and support exploration of the events. To characterize events, LeadLine integrates topic modeling, event detection, and named entity recognition techniques to automatically extract information regarding the investigative 4 Ws: who, what, when, and where for each event. To further support analysis of the text corpora through events, LeadLine allows users to interactively examine meaningful events using the 4 Ws to develop an understanding of how and why. Through representing large-scale text corpora in the form of meaningful events, LeadLine provides a concise summary of the corpora. LeadLine also supports the construction of simple narratives through the exploration of events. To demonstrate the efficacy of LeadLine in identifying events and supporting exploration, two case studies were conducted using news and social media data.
  • Keywords
    information retrieval; social networking (online); text analysis; LeadLine; automatically identify meaningful events; entity recognition techniques; event exploration; event identification; information extraction; interactive visual analysis; interactive visual analytics system; large-scale text corpora; microblogs; news data; online news; social media data; support exploration; text data; visual text analysis systems; Crawlers; Data mining; Event detection; Lead; Time series analysis; Twitter; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Visual Analytics Science and Technology (VAST), 2012 IEEE Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    978-1-4673-4752-5
  • Type

    conf

  • DOI
    10.1109/VAST.2012.6400485
  • Filename
    6400485