DocumentCode
614107
Title
MCF: MultiComponent Features for Malware Analysis
Author
Vinod, P. ; Laxmi, V. ; Gaur, M.S. ; Naval, S. ; Faruki, Parvez
Author_Institution
Dept. of Comput. Eng., Malaviya Nat. Inst. of Technol., Jaipur, India
fYear
2013
fDate
25-28 March 2013
Firstpage
1076
Lastpage
1081
Abstract
In this paper, we use machine learning techniques for classifying a Portable Executable (PE) file as malware or benign. This is achieved by extracting a new feature also referred to us as MultiComponent Feature composed of (a) PE metadata (b) Principal Instruction Code (PIC)(c) mnemonic bi-gram and (d) prominent unigrams that characterizes malware/benign files. Reduced feature set are obtained using feature selection and reduction methods such as Minimum Redundancy and Maximum Relevance (mRMR), Principal Component Analysis (PCA) and prominent EigenVector Feature (EVF). We demonstrate that amongst mRMR, PCA and EVF, mRMR feature selection method is suitable for extracting optimal PE attributes. The performance of our proposed method is compared with similar work reported in previous literature´s and we have found that the detection rate with our methodology is found to be better compared to prior work. This suggest that the proposed method can be used effectively for the identification of malicious files.
Keywords
eigenvalues and eigenfunctions; invasive software; learning (artificial intelligence); principal component analysis; EVF; MCF; PCA; PE metadata; PIC; eigenvector feature; fature reduction; mRMR feature selection; machine learning; malicious files identification; malware analysis; minimum redundancy-maximum relevance; mnemonic bigram; multicomponent features; portable executable file; principal component analysis; principal instruction code; prominent unigram; Accuracy; Data mining; Feature extraction; Malware; Principal component analysis; Radio frequency; Vectors; classifiers; eigen vectors; features; mRMR; malware; principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Information Networking and Applications Workshops (WAINA), 2013 27th International Conference on
Conference_Location
Barcelona
Print_ISBN
978-1-4673-6239-9
Electronic_ISBN
978-0-7695-4952-1
Type
conf
DOI
10.1109/WAINA.2013.147
Filename
6550538
Link To Document