• DocumentCode
    614107
  • Title

    MCF: MultiComponent Features for Malware Analysis

  • Author

    Vinod, P. ; Laxmi, V. ; Gaur, M.S. ; Naval, S. ; Faruki, Parvez

  • Author_Institution
    Dept. of Comput. Eng., Malaviya Nat. Inst. of Technol., Jaipur, India
  • fYear
    2013
  • fDate
    25-28 March 2013
  • Firstpage
    1076
  • Lastpage
    1081
  • Abstract
    In this paper, we use machine learning techniques for classifying a Portable Executable (PE) file as malware or benign. This is achieved by extracting a new feature also referred to us as MultiComponent Feature composed of (a) PE metadata (b) Principal Instruction Code (PIC)(c) mnemonic bi-gram and (d) prominent unigrams that characterizes malware/benign files. Reduced feature set are obtained using feature selection and reduction methods such as Minimum Redundancy and Maximum Relevance (mRMR), Principal Component Analysis (PCA) and prominent EigenVector Feature (EVF). We demonstrate that amongst mRMR, PCA and EVF, mRMR feature selection method is suitable for extracting optimal PE attributes. The performance of our proposed method is compared with similar work reported in previous literature´s and we have found that the detection rate with our methodology is found to be better compared to prior work. This suggest that the proposed method can be used effectively for the identification of malicious files.
  • Keywords
    eigenvalues and eigenfunctions; invasive software; learning (artificial intelligence); principal component analysis; EVF; MCF; PCA; PE metadata; PIC; eigenvector feature; fature reduction; mRMR feature selection; machine learning; malicious files identification; malware analysis; minimum redundancy-maximum relevance; mnemonic bigram; multicomponent features; portable executable file; principal component analysis; principal instruction code; prominent unigram; Accuracy; Data mining; Feature extraction; Malware; Principal component analysis; Radio frequency; Vectors; classifiers; eigen vectors; features; mRMR; malware; principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops (WAINA), 2013 27th International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4673-6239-9
  • Electronic_ISBN
    978-0-7695-4952-1
  • Type

    conf

  • DOI
    10.1109/WAINA.2013.147
  • Filename
    6550538