DocumentCode :
1822900
Title :
Exploiting hashtags for adaptive microblog crawling
Author :
Xinyue Wang ; Tokarchuk, Laurissa ; Cuadrado, Felix ; Poslad, Stefan
Author_Institution :
Sch. of Electron. Eng. & Comput. Sci., Queen Mary, Univ. of London, London, UK
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
311
Lastpage :
315
Abstract :
Researchers have capitalized on microblogging services, such as Twitter, for detecting and monitoring real world events. Existing approaches have based their conclusions on data collected by monitoring a set of pre-defined keywords. In this paper, we show that this manner of data collection risks losing a significant amount of relevant information. We then propose an adaptive crawling model that detects emerging popular hashtags, and monitors them to retrieve greater amounts of highly associated data for events of interest. The proposed model analyzes the traffic patterns of the hashtags collected from the live stream to update subsequent collection queries. To evaluate this adaptive crawling model, we apply it to a dataset collected during the 2012 London Olympic Games. Our analysis shows that adaptive crawling based on the proposed Refined Keyword Adaptation algorithm collects a more comprehensive dataset than pre-defined keyword crawling, while only introducing a minimum amount of noise.
Keywords :
data acquisition; query processing; search engines; social networking (online); 2012 London Olympic Games dataset; Twitter; adaptive crawling model; adaptive microblog crawling; collection queries; data collection; hashtags; live stream; microblogging services; predefined keywords; real world event detection; real world event monitoring; refined keyword adaptation algorithm; traffic patterns; Adaptation models; Correlation; Crawlers; Media; Noise; Twitter; Data Crawler; Hashtag; Social Network; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
Conference_Location :
Niagara Falls, ON
Type :
conf
Filename :
6785725
Link To Document :
بازگشت