Title :
Automatic Detection of Phishing Target from Phishing Webpage
Author :
Liu, Gang ; Qiu, Bite ; Wenyin, Liu
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
Abstract :
An approach to identification of the phishing target of a given (suspicious) webpage is proposed by clustering the webpage set consisting of its all associated webpages and the given webpage itself. We first find its associated webpages, and then explore their relationships to the given webpage as their features for clustering. Such relationships include link relationship, ranking relationship, text similarity, and webpage layout similarity relationship. A DBSCAN clustering method is employed to find if there is a cluster around the given webpage. If such cluster exists, we claim the given webpage is a phishing webpage and then find its phishing target (i.e., the legitimate webpage it is attacking) from this cluster. Otherwise, we identify it as a legitimate webpage. Our test dataset consists of 8745 phishing pages (targeting at 76 well-known websites) selected from Phish Tank and preliminary experiments show that the approach can successfully identify 91.44% of their phishing targets. Another dataset of 1000 legitimate webpages is collected to test our method´s false alarm rate, which is 3.40%.
Keywords :
Internet; computer crime; pattern clustering; text analysis; DBSCAN clustering method; Webpage layout similarity relationship; Webpage set clustering; legitimate Webpage; link relationship; phishing Webpage; phishing target automatic detection; ranking relationship; text similarity; Accuracy; Clustering algorithms; Data mining; Feature extraction; Layout; Object detection; Visualization; Anti-Phishing; DBSCAN Clustering; Phishing; Web Document Analysis;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1010