DocumentCode :
1799745
Title :
Visual Similarity Based Anti-phishing with the Combination of Local and Global Features
Author :
Yu Zhou ; Yongzheng Zhang ; Jun Xiao ; Yipeng Wang ; Weiyao Lin
Author_Institution :
Inst. of Inf. Eng., Beijing, China
fYear :
2014
fDate :
24-26 Sept. 2014
Firstpage :
189
Lastpage :
196
Abstract :
Phishing uses a fake Web page to steal personal sensitive information such as credit card numbers and passwords. Generally, the fake Web page is visually similar to the legitimate target Web page. The phishers can obtain financial benefits through these information. Anti-phishing is very important for a variety of applications such as phishing attacks, online transaction security, and user privacy protection. In this paper, we propose a novel and effective visual similarity based phishing detection approach that compares the snapshot image pair of the suspected Web page and the protected Web page. The proposed approach is based on the key insight that both the local and the global features of the Web page image can be used to represent the visual characteristics of the Web page together. This approach is purely on the image level, and thus can effectively deal with the non-text phishing tricks including images or Flashes objects in the HTML contents. For the local feature, the existence of the target logo is detected. For the global feature, the similarity of the visible part of the Web page is considered. We implemented and evaluated the proposed approach on a large scale dataset consisting of 2,129 real world phishing Web pages and 1,367 irrelevant legitimate Web pages. The experimental results show that the proposed approach can achieve over 90.00% true positive rate and 97.00% true negative rate. Our approach has been applied in the anti-phishing project of a major Internet Service Provider and gives a periodical reports to the potential users.
Keywords :
Web sites; computer crime; feature extraction; HTML contents; Internet service provider; Web page image; credit card numbers; fake Web page; global features; image level; large-scale dataset; legitimate target Web page; local features; nontext phishing tricks; online transaction security; passwords; personal sensitive information stealing; phishing attacks; protected Web page; snapshot image pair; suspected Web page; target logo detection; true negative rate; true positive rate; user privacy protection; visual characteristics representation; visual similarity antiphishing; visual similarity based phishing detection approach; Feature extraction; HTML; Image color analysis; Image resolution; Security; Visualization; Web pages; EMD Algorithm; Logo Detection; Phishing Detection; Visual Similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Trust, Security and Privacy in Computing and Communications (TrustCom), 2014 IEEE 13th International Conference on
Conference_Location :
Beijing
Type :
conf
DOI :
10.1109/TrustCom.2014.28
Filename :
7011250
Link To Document :
بازگشت