DocumentCode :
2119643
Title :
Extraction and Compilation of Events and Sub-events from Twitter
Author :
Khurdiya, Arpit ; Dey, Lipika ; Mahajan, Dhruv ; Verma, Ishan
Author_Institution :
TCS Innovation Labs., Tata Consultancy Services Ltd., Delhi, India
Volume :
1
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
504
Lastpage :
508
Abstract :
Twitter has emerged as a great source to provide insights about upcoming planned and unplanned events of social, economic and political relevance. Big events are publicized and known in advance, but smaller, unplanned sub-events around them are not always advertised. These unplanned events may have a large localized impact. If known in advance, knowledge about events like threats, protests, demonstrations etc. or even about large flash mobs can be utilized by planners and event managers. Given the large volumes of tweets floating around at any given time, identifying relevant sub-events is a non-trivial task. In this paper, we explore machine learning techniques to identify, extract and build a map of small sub-events around a big, popular event. We use CRFs to extract event components from tweets. Events are resolved for uniqueness and compiled into a complete calendar. The model is evaluated on tweets around Olympic Games. The framework is generic enough to be adapted to other domains.
Keywords :
learning (artificial intelligence); social networking (online); sport; CRF; Olympic Games; Twitter; event compilation; event extraction; machine learning techniques; sub-event identification; tweets; Conditional Random Fields; Entity Resolution; Event Extraction; Social Media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.192
Filename :
6511931
Link To Document :
بازگشت