DocumentCode :
166419
Title :
Approximate string matching algorithm for phishing detection
Author :
Abraham, Dona ; Raj, Nisha S.
Author_Institution :
Dept. of Comput. Sci. & Eng., SCMS Sch. of Eng. & Technol., Ernakulam, India
fYear :
2014
fDate :
24-27 Sept. 2014
Firstpage :
2285
Lastpage :
2290
Abstract :
Phishing is an act of stealing personal and sensitive user information through internet and using it for financial transactions. The goal of phishers is to carry out fraudulent transactions on behalf of the victims by using the information stealed from them. Availing the services of internet has become a dangerous task to the common people with these kinds of attacks. Many methods have been developed to fight against phishing attacks. But, as the attacker uses more sophisticated techniques each method fails to perform well in detecting the attacks. Here we propose a string matching method for detecting phishing attacks, which determines the degree of similarity a URL is having with the blacklisted URLs. Thus based on the textual properties of a URL it can be classified as phishing or non-phishing. Two string matching algorithms i.e. Longest Common Subsequence (LCS) and Edit Distance are used in the hostname comparison. The accuracy rate obtained for LCS is 99.1% and for Edit Distance it is 99.5%.
Keywords :
Internet; computer crime; string matching; text analysis; unsolicited e-mail; Internet; LCS; URL textual properties; approximate string matching algorithm; edit distance; fraudulent transactions; longest common subsequence; personal information stealing; phishing attack detection; sensitive user information stealing; Accuracy; Electronic mail; Feature extraction; IP networks; Internet; Training; Uniform resource locators; Approximate String matching; Blacklist; Edit Distance; Longest Common Subsequence(LCS); Phishing Attacks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
Type :
conf
DOI :
10.1109/ICACCI.2014.6968578
Filename :
6968578
Link To Document :
بازگشت