DocumentCode :
29829
Title :
SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection
Author :
O´Kane, P. ; Sezer, Sakir ; McLaughlin, Keiran ; Eul Gyu Im
Author_Institution :
Centre for Secure Inf. Technol., Queen´´s Univ. Belfast, Belfast, UK
Volume :
8
Issue :
3
fYear :
2013
fDate :
Mar-13
Firstpage :
500
Lastpage :
509
Abstract :
N-gram analysis is an approach that investigates the structure of a program using bytes, characters, or text strings. A key issue with N-gram analysis is feature selection amidst the explosion of features that occurs when N is increased. The experiments within this paper represent programs as operational code (opcode) density histograms gained through dynamic analysis. A support vector machine is used to create a reference model, which is used to evaluate two methods of feature reduction, which are “area of intersect” and “subspace analysis using eigenvectors.” The findings show that the relationships between features are complex and simple statistics filtering approaches do not provide a viable approach. However, eigenvector subspace analysis produces a suitable filter.
Keywords :
eigenvalues and eigenfunctions; invasive software; statistical analysis; support vector machines; N-gram analysis; SVM training phase reduction; dataset feature filtering; density histograms; eigenvector subspace analysis; feature reduction; feature selection; malware detection; reference model; statistics filtering; support vector machine; Filtering; Kernel; Malware; Materials; Support vector machines; Training; KNN; SVM; metamorphism malware; obfuscation; packers; polymorphism;
fLanguage :
English
Journal_Title :
Information Forensics and Security, IEEE Transactions on
Publisher :
ieee
ISSN :
1556-6013
Type :
jour
DOI :
10.1109/TIFS.2013.2242890
Filename :
6420939
Link To Document :
بازگشت