DocumentCode
2351140
Title
Design and Evaluation of a Real-Time URL Spam Filtering Service
Author
Thomas, Kurt ; Grier, Chris ; Ma, Justin ; Paxson, Vern ; Song, Dawn
Author_Institution
Univ. of California, Berkeley, CA, USA
fYear
2011
fDate
22-25 May 2011
Firstpage
447
Lastpage
462
Abstract
On the heels of the widespread adoption of web services such as social networks and URL shorteners, scams, phishing, and malware have become regular threats. Despite extensive research, email-based spam filtering techniques generally fall short for protecting other web services. To better address this need, we present Monarch, a real-time system that crawls URLs as they are submitted to web services and determines whether the URLs direct to spam. We evaluate the viability of Monarch and the fundamental challenges that arise due to the diversity of web service spam. We show that Monarch can provide accurate, real-time protection, but that the underlying characteristics of spam do not generalize across web services. In particular, we find that spam targeting email qualitatively differs in significant ways from spam campaigns targeting Twitter. We explore the distinctions between email and Twitter spam, including the abuse of public web hosting and redirector services. Finally, we demonstrate Monarch´s scalability, showing our system could protect a service such as Twitter -- which needs to process 15 million URLs/day -- for a bit under $800/day.
Keywords
Web services; information filtering; invasive software; social networking (online); unsolicited e-mail; Monarch scalability; Twitter spam; URL shorteners; email based spam filtering techniques; malware; phishing; public web hosting; real-time URL Spam filtering service; redirector services; scams; social networks; underlying characteristics; web services; Browsers; Electronic mail; Feature extraction; HTML; IP networks; Real time systems; Web services;
fLanguage
English
Publisher
ieee
Conference_Titel
Security and Privacy (SP), 2011 IEEE Symposium on
Conference_Location
Berkeley, CA
ISSN
1081-6011
Print_ISBN
978-1-4577-0147-4
Electronic_ISBN
1081-6011
Type
conf
DOI
10.1109/SP.2011.25
Filename
5958045
Link To Document