DocumentCode :
1619244
Title :
High-Performance Classification of Phishing URLs Using a Multi-modal Approach with MapReduce
Author :
Shrestha, Niju ; Kharel, Rajan Kumar ; Britt, Jason ; Hasan, Ragib
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Alabama at Birmingham, Birmingham, AL, USA
fYear :
2015
Firstpage :
206
Lastpage :
212
Abstract :
Classifying phishing websites can be expensive both computationally and financially given a large enough volume of suspect sites. A distributed cloud environment can reduce the computational time and financial cost significantly. To test this idea, we apply a multi-modal feature classification algorithm to classify phishing websites in a non-distributed and several distributed environments. A multi-modal approach combines both visual and text features for classification. The implementation extracts color feature and histogram feature from the screenshot of a phishing website and text from its html source code. Feature extraction and comparison is accomplished by applying the MapReduce framework. Implementing the multi-modal approach in a distributed environment proves to reduce the runtime as well as the financial costs. We present results that show our work is 30 times faster than existing state of the art systems in phishing website classification problem.
Keywords :
Web sites; computer crime; data handling; feature extraction; parallel programming; MapReduce framework; URL; color feature extraction; distributed cloud environment; high-performance classification; histogram feature extraction; multimodal approach; multimodal feature classification algorithm; phishing Website classification problem; Classification algorithms; Color; Feature extraction; Histograms; Image color analysis; Visualization; Web pages; Color code; Map Reduce; Phishing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services (SERVICES), 2015 IEEE World Congress on
Conference_Location :
New York City, NY
Print_ISBN :
978-1-4673-7274-9
Type :
conf
DOI :
10.1109/SERVICES.2015.38
Filename :
7196526
Link To Document :
بازگشت