Title :
WarningBird: A Near Real-Time Detection System for Suspicious URLs in Twitter Stream
Author :
Sangho Lee ; Jong Kim
Author_Institution :
Dept. of Comput. Sci. & Eng., Pohang Univ. of Sci. & Technol. (POSTECH), Pohang, South Korea
Abstract :
Twitter is prone to malicious tweets containing URLs for spar, phishing, and malware distribution. Conventional Twitter spar detection schemes utilize account features such as the ratio of tweets containing URLs and the account creation date, or relation features in the Twitter graph. These detection schemes are ineffective against feature fabrications or consume much time and resources. Conventional suspicious URL detection schemes utilize several features including lexical features of URLs, URL redirection, HTIUIL content, and dynamic behavior. However, evading techniques such as time-based evasion and crawler evasion exist. in this paper, we propose WARNINGBIRD, a suspicious URL detection system for Twitter. Our system investigates correlations of URL redirect chains extracted from several tweets. Because attackers have limited resources and usually reuse them, their URL redirect chains frequently share the same URLs. We develop methods to discover correlated URL redirect chains using the frequently shared URLs and to determine their suspiciousness. We collect numerous tweets from the Twitter public timeline and build a statistical classifier using them. Evaluation results show that our classifier accurately and efficiently detects suspicious URLs. We also present WARNINGBIRD as a near real-time system for classifying suspicious URLs in the Twitter stream.
Keywords :
computer crime; invasive software; pattern classification; social networking (online); statistical analysis; unsolicited e-mail; Twitter graph; Twitter public timeline; Twitter spam detection schemes; Twitter stream; WarningBird; correlated URL redirect chains; frequently shared URL; malicious tweets; malware distribution; near real-time detection system; phishing; statistical classifier; suspicious URL classification; suspicious URL detection system; Browsers; Crawlers; Feature extraction; IP networks; Servers; Training; Twitter; Suspicious URL; URL redirection; classification; conditional redirection; twitter;
Journal_Title :
Dependable and Secure Computing, IEEE Transactions on
DOI :
10.1109/TDSC.2013.3