Title :
Phishing detection using classifier ensembles
Author :
Toolan, Fergus ; Carthy, Joe
Author_Institution :
Sch. of Comput. Sci. & Inf., Univ. Coll. Dublin, Dublin, Ireland
fDate :
Sept. 20 2009-Oct. 21 2009
Abstract :
This paper introduces an approach to classifying emails into phishing/non-phishing categories using the C5.0 algorithm which achieves very high precision and an ensemble of other classifiers that achieve high recall. The representation of instances used in this paper is very small consisting of only five features. Results of an evaluation of this system, using over 8,000 emails approximately half of which were phishing emails and the remainder legitimate, are presented. These results show the benefits of using this recall boosting technique over that of any individual classifier or collection of classifiers.
Keywords :
Web sites; classification; computer crime; unsolicited e-mail; C5.0 algorithm; classifier ensembles; email classification; phishing detection; recall boosting; Clustering algorithms; Computer security; Credit cards; Informatics; Information security; Information technology; Internet; Laboratories; Reliability engineering; Uniform resource locators;
Conference_Titel :
eCrime Researchers Summit, 2009. eCRIME '09.
Conference_Location :
Tacoma, WA
Print_ISBN :
978-1-4244-4625-4
DOI :
10.1109/ECRIME.2009.5342607