• DocumentCode
    1782748
  • Title

    Towards effective feature selection in machine learning-based botnet detection approaches

  • Author

    Beigi, Elaheh Biglar ; Jazi, Hossein Hadian ; Stakhanova, Natalia ; Ghorbani, Ali A.

  • Author_Institution
    Inf. Security Center of Excellence, Univ. of New Brunswick, Fredericton, NB, Canada
  • fYear
    2014
  • fDate
    29-31 Oct. 2014
  • Firstpage
    247
  • Lastpage
    255
  • Abstract
    Botnets, as one of the most formidable cyber security threats, are becoming more sophisticated and resistant to detection. In spite of specific behaviors each botnet has, there exist adequate similarities inside each botnet that separate its behavior from benign traffic. Several botnet detection systems have been proposed based on these similarities. However, offering a solution for differentiating botnet traffic (even those using same protocol, e.g. IRC) from normal traffic is not trivial. Extraction of features in either host or network level to model a botnet has been one of the most popular methods in botnet detection. A subset of features, usually selected based on some intuitive understanding of botnets, is used by the machine learning algorithms to classify/ cluster botnet traffic. These approaches, tested against two or three botnet traces, have mostly showed satisfactory detection results. Even though, their effectiveness in detection of other botnets or real traffic remains in doubt. Additionally, effectiveness of different combination of features in terms of providing more detection coverage has not been fully studied. In this paper we revisit flow-based features employed in the existing botnet detection studies and evaluate their relative effectiveness. To ensure a proper evaluation we create a dataset containing a diverse set of botnet traces and background traffic.
  • Keywords
    invasive software; learning (artificial intelligence); botnet detection; botnet traffic; cyber security threat; feature selection; flow-based feature; machine learning; Accuracy; Feature extraction; IP networks; Peer-to-peer computing; Ports (Computers); Protocols; Security;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Network Security (CNS), 2014 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1109/CNS.2014.6997492
  • Filename
    6997492