DocumentCode :
6600
Title :
Malwise—An Effective and Efficient Classification System for Packed and Polymorphic Malware
Author :
Cesare, Silvio ; Yang Xiang ; Wanlei Zhou
Author_Institution :
Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
Volume :
62
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
1193
Lastpage :
1206
Abstract :
Signature-based malware detection systems have been a much used response to the pervasive problem of malware. Identification of malware variants is essential to a detection system and is made possible by identifying invariant characteristics in related samples. To classify the packed and polymorphic malware, this paper proposes a novel system, named Malwise, for malware classification using a fast application-level emulator to reverse the code packing transformation, and two flowgraph matching algorithms to perform classification. An exact flowgraph matching algorithm is employed that uses string-based signatures, and is able to detect malware with near real-time performance. Additionally, a more effective approximate flowgraph matching algorithm is proposed that uses the decompilation technique of structuring to generate string-based signatures amenable to the string edit distance. We use real and synthetic malware to demonstrate the effectiveness and efficiency of Malwise. Using more than 15,000 real malware, collected from honeypots, the effectiveness is validated by showing that there is an 88 percent probability that new malware is detected as a variant of existing malware. The efficiency is demonstrated from a smaller sample set of malware where 86 percent of the samples can be classified in under 1.3 seconds.
Keywords :
digital signatures; graph theory; invasive software; pattern classification; pattern matching; Malwise system; application-level emulator; code packing transformation; decompilation technique; flowgraph matching algorithm; honeypots; malware classification; malware variant identification; packed malware; polymorphic malware; signature-based malware detection system; string edit distance; string-based signature; Approximation algorithms; Classification algorithms; Databases; Emulation; Entropy; Flow graphs; Malware; Computer security; control flow; malware; structural classification; structured control flow; unpacking;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2012.65
Filename :
6171162
Link To Document :
بازگشت