Title of article
Phishing website detection using weighted feature line embedding
Author/Authors
Imani ، Maryam - Tarbiat Modares University , Montazer ، Gholam Ali - Tarbiat Modares University
Pages
13
From page
49
To page
61
Abstract
The aim of phishing is tracing the users #039; s private information without their permission by designing a new website which mimics the trusted website. The specialists of information technology do not agree on a unique definition for the discriminative features that characterizes the phishing websites. Therefore, the number of reliable training samples in phishing detection problems is limited. Moreover, among the available training samples, there are abnormal samples that cause classification error. For instance, it is possible that there are phishing samples with similar features to legitimate ones and vice versa. A supervised feature extraction method, called weighted feature line embedding, is proposed in this paper to solve these problems. The proposed method virtually generates training samples by utilizing the feature line metric. Hence, it can solve the small sample size problem. Moreover, by assigning appropriate weights to each pair of feature points, it corrects the undesirable quality of abnormal samples. The features extracted by our method improve the performance of phishing website detection specially by using small training sets.
Keywords
Phishing Detection , Feature Extraction , Feature Line , Virtual Training
Journal title
ISeCure, The ISC International Journal of Information Security
Serial Year
2017
Journal title
ISeCure, The ISC International Journal of Information Security
Record number
2454672
Link To Document