DocumentCode :
1791677
Title :
WS2F: A weakly supervised framework for data stream filtering
Author :
Cailing Dong ; Agarwal, Abhishek
Author_Institution :
Dept. of Inf. Syst., Univ. of Maryland, Baltimore, MD, USA
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
50
Lastpage :
57
Abstract :
In this paper we present a weakly supervised framework for relevant content filtering from social media platforms such as Twitter. Social media platforms are a rich source of information these days. However of all the available information, there is only a small fraction of which is of general interest. Most of the other information pertains to personal events, and is very specific to the users who are contributing that. It is therefore usually not of general interest. In this paper, we present a framework to filter out the topic-specific relevant information from the irrelevant information in the stream of text provided by social media platforms. Our framework does not depend on any labeled data, however it is capable of using domain knowledge in the form of rules and guidelines provided by domain experts. It is therefore easily extensible for new topics and events. The proposed framework is built keeping the streaming nature of social media platforms in mind, i.e., it is able to discover the content relevant to a specific event as it evolves in the text stream. Because of its adaptive nature, it is not only able to filter the relevant content, but also able to generate event story lines as the event evolves. We experiment on a dataset provided by TREC, and show that the framework not only filters relevant content for an event but also generates its story line effectively.
Keywords :
social networking (online); TREC; WS2F; data stream filtering; social media platforms; weakly supervised framework; Correlation; Media; Reliability; Silicon; Testing; Training data; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004356
Filename :
7004356
Link To Document :
بازگشت