• DocumentCode
    669119
  • Title

    Dynamic classification of packing algorithms for inspecting executables using entropy analysis

  • Author

    Bat-Erdene, Munkhbayar ; Taebeom Kim ; Hongzhe Li ; Heejo Lee

  • Author_Institution
    Div. of Comput. & Commun. Eng., Korea Univ., Seoul, South Korea
  • fYear
    2013
  • fDate
    22-24 Oct. 2013
  • Firstpage
    19
  • Lastpage
    26
  • Abstract
    Packing is widely used for bypassing anti-malware systems, and the proportion of packed malware has been growing rapidly, making up over 80% of malware. Few studies on detecting packing algorithms have been conducted during last two decades. In this paper, we propose a method to classify packing algorithms of given packed executables. First, we convert entropy values of the packed executables loaded in memory into symbolic representations. Our proposed method uses SAX (Symbolic Aggregate Approximation) which is known to be good at large data conversion. Due to its advantage of simplifying complicated patterns, symbolic representation is commonly used in bio-informatics and data mining fields. Second, we classify the distribution of symbols using supervised learning classifications, i.e., Naive Bayes and Support Vector Machines. Results of our experiments with a collection of 466 programs and 15 packing algorithms demonstrated that our method can identify packing algorithms of given executables with a high accuracy of 94.2%, recall of 94.7% and precision of 92.7%. It has been confirmed that packing algorithms can be identified using entropy analysis, which is a measure of uncertainty of running executables, without a prior knowledge of the executable.
  • Keywords
    electronic data interchange; entropy; invasive software; learning (artificial intelligence); pattern classification; symbol manipulation; SAX; anti-malware systems; data conversion; entropy analysis; entropy values; executables inspection; packed malware; packing algorithms dynamic classification; running executables uncertainty measurement; supervised learning classifications; symbolic aggregate approximation; symbolic representations; Algorithm design and analysis; Classification algorithms; Entropy; Malware; Software algorithms; Support vector machines; Time series analysis; Entropy Analysis; Original Entry Point (OEP); Packing Algorithms; Piecewise Aggregate Approximation (PAA); Symbolic Aggregate Approximation (SAX);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Malicious and Unwanted Software: "The Americas" (MALWARE), 2013 8th International Conference on
  • Conference_Location
    Fajardo, PR
  • Print_ISBN
    978-1-4799-2534-6
  • Type

    conf

  • DOI
    10.1109/MALWARE.2013.6703681
  • Filename
    6703681