Title :
Exploratory experiments to identify fake websites by using features from the network stack
Author :
Koepke, Jason ; Kaza, Siddharth ; Abbasi, Ahmed
Author_Institution :
Dept. of Comput. & Inf. Sci., Towson Univ., Towson, MD, USA
Abstract :
Users on the web are unknowingly becoming more susceptible to scams from cyber deviants and malicious websites. There has been much work in the identification of malicious websites using application layer features based on content (HTML, images, links, etc.) and a plethora of classification techniques. However, there has been little work on using features from the other layers in the Open Systems Interconnection (OSI) network stack. Capturing features from the transport and internet layers of the network stack based on responses to various Hypertext Transfer Protocol (HTTP) requests may allow for increased classification accuracy. In this paper, we use learning techniques (Winnow, Logit Regression, Naïve Bayes, J48, and Bayesian) utilizing these new features to identify fake pharmacy websites. The results show that using transport and Internet layer features yields an accuracy of 80% to 95% for detecting fake websites using standard machine learning algorithms. The results suggest that many organizations may be hosting multiple websites using shared code and hosting services to enable them to produce the maximum number of fraudulent websites.
Keywords :
Internet; Web sites; hypermedia markup languages; security of data; HTTP; Internet layers; OSI; classification techniques; cyber deviants; exploratory experiments; fake websites identification; hypertext transfer protocol; learning techniques; malicious websites; network stack; open systems interconnection; pharmacy websites; Accuracy; Classification algorithms; IP networks; Internet; Machine learning algorithms; Protocols; Servers; cyber deviants; fake websites; machine learning; web mining; website signatures;
Conference_Titel :
Intelligence and Security Informatics (ISI), 2012 IEEE International Conference on
Conference_Location :
Arlington, VA
Print_ISBN :
978-1-4673-2105-1
DOI :
10.1109/ISI.2012.6284144