• DocumentCode
    679821
  • Title

    Detection of phishing URLs using machine learning techniques

  • Author

    James, Jesin ; Sandhya, L. ; Thomas, Cedric

  • Author_Institution
    SCT Coll. of Eng., Trivandrum, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    304
  • Lastpage
    309
  • Abstract
    Phishing costs Internet users billions of dollars per year. It refers to luring techniques used by identity thieves to fish for personal information in a pond of unsuspecting Internet users. Phishers use spoofed e-mail, phishing software to steal personal information and financial account details such as usernames and passwords. This paper deals with methods for detecting phishing Web sites by analyzing various features of benign and phishing URLs by Machine learning techniques. We discuss the methods used for detection of phishing Web sites based on lexical features, host properties and page importance properties. We consider various data mining algorithms for evaluation of the features in order to get a better understanding of the structure of URLs that spread phishing. The fine-tuned parameters are useful in selecting the apt machine learning algorithm for separating the phishing sites from benign sites.
  • Keywords
    Web sites; authorisation; data mining; learning (artificial intelligence); unsolicited e-mail; Internet users; benign sites; data mining algorithms; financial account stealing; host properties; lexical feature analysis; machine learning algorithm selection; page importance properties; passwords; personal information stealing; phishing URL detection; phishing Web site detection; phishing software; spoofed e-mail; usernames; Classification algorithms; Electronic mail; Feature extraction; Google; Internet; MATLAB; Web pages; Page rank; Phishing; URL; WHOIS; benign;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Communication and Computing (ICCC), 2013 International Conference on
  • Conference_Location
    Thiruvananthapuram
  • Print_ISBN
    978-1-4799-0573-7
  • Type

    conf

  • DOI
    10.1109/ICCC.2013.6731669
  • Filename
    6731669