• DocumentCode
    3725236
  • Title

    Investigating the effect of feature selection and dimensionality reduction on phishing website classification problem

  • Author

    Pradeep Singh;Niti Jain;Ambar Maini

  • Author_Institution
    Department of Computer Science, National Institute of Technology, Raipur, India
  • fYear
    2015
  • Firstpage
    388
  • Lastpage
    393
  • Abstract
    Phishing is a term given to the method of gaining unauthorized access to a person´s private information like passwords, account or credit card details. It is a deception technique that utilizes social engineering & technology to convince a victim to provide personal information, usually for monetary benefits. Phishing attacks have become frequent and involve the risk of identity theft and financial losses. Detection of phishing website has become very important for online banking and e-commerce users. We proposed an effective model that is based on preprocessing (Feature selection and dimensionality reduction) and classification DataMining algorithms. These algorithms were used to characterize and identify all the factors to classify the phishing website. We implemented five different classification algorithm and four preprocessing techniques to classify a websites legitimate or phishy. We also compared their respective performances in terms of accuracy and AUC.
  • Keywords
    "Principal component analysis","Support vector machines","Postal services","Vegetation"
  • Publisher
    ieee
  • Conference_Titel
    Next Generation Computing Technologies (NGCT), 2015 1st International Conference on
  • Type

    conf

  • DOI
    10.1109/NGCT.2015.7375147
  • Filename
    7375147