• DocumentCode
    6600
  • Title

    Malwise—An Effective and Efficient Classification System for Packed and Polymorphic Malware

  • Author

    Cesare, Silvio ; Yang Xiang ; Wanlei Zhou

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
  • Volume
    62
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1193
  • Lastpage
    1206
  • Abstract
    Signature-based malware detection systems have been a much used response to the pervasive problem of malware. Identification of malware variants is essential to a detection system and is made possible by identifying invariant characteristics in related samples. To classify the packed and polymorphic malware, this paper proposes a novel system, named Malwise, for malware classification using a fast application-level emulator to reverse the code packing transformation, and two flowgraph matching algorithms to perform classification. An exact flowgraph matching algorithm is employed that uses string-based signatures, and is able to detect malware with near real-time performance. Additionally, a more effective approximate flowgraph matching algorithm is proposed that uses the decompilation technique of structuring to generate string-based signatures amenable to the string edit distance. We use real and synthetic malware to demonstrate the effectiveness and efficiency of Malwise. Using more than 15,000 real malware, collected from honeypots, the effectiveness is validated by showing that there is an 88 percent probability that new malware is detected as a variant of existing malware. The efficiency is demonstrated from a smaller sample set of malware where 86 percent of the samples can be classified in under 1.3 seconds.
  • Keywords
    digital signatures; graph theory; invasive software; pattern classification; pattern matching; Malwise system; application-level emulator; code packing transformation; decompilation technique; flowgraph matching algorithm; honeypots; malware classification; malware variant identification; packed malware; polymorphic malware; signature-based malware detection system; string edit distance; string-based signature; Approximation algorithms; Classification algorithms; Databases; Emulation; Entropy; Flow graphs; Malware; Computer security; control flow; malware; structural classification; structured control flow; unpacking;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2012.65
  • Filename
    6171162