Title :
Research on N-gram-based malicious code feature extraction algorithm
Author :
Fang, Luo ; Qingyu, Ou ; Guoheng, Wei
Author_Institution :
Dept. of Inf. Security, Naval Univ. of Eng., Wuhan, China
Abstract :
The amount of computer virus is on the increase since its first appearance and has posed serious security threats to the computer systems. Most of the current anti-virus systems attempt to detect these new malicious programs through heuristics scheme, but this costs a lot and is often ineffective. In this paper, an N-gram-based malicious code feature extraction algorithm, based on statistical language model, is presented. Through this algorithm, the N-gram features of the sample set can be extracted and the features of the malicious code can be obtained exactly. Compared with the traditional feature code-based approaches, our approach has higher detection rates for new malicious codes.
Keywords :
computer viruses; feature extraction; heuristic programming; N-gram-based malicious code feature extraction algorithm; computer system security; computer virus; heuristics scheme; statistical language model; Feature vector; Malicious code detection; N-gram; Statistical language model;
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
DOI :
10.1109/ICCASM.2010.5619983