• DocumentCode
    1845321
  • Title

    Characterizing Obfuscated JavaScript Using Abstract Syntax Trees: Experimenting with Malicious Scripts

  • Author

    Blanc, Gregory ; Miyamoto, Daisuke ; Akiyama, Mitsuaki ; Kadobayashi, Youki

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
  • fYear
    2012
  • fDate
    26-29 March 2012
  • Firstpage
    344
  • Lastpage
    351
  • Abstract
    Obfuscation, code transformations that make the code unintelligible, is still an issue for web malware analysts and is still a weapon of choice for attackers. Worse, some researchers have arbitrarily decided to consider obfuscated contents as malicious although it has been proven wrong. Yet, we can assume than some web attack kits only feature a fraction of existing obfuscating transformations which may make it easy to detect malicious scripting contents. However, because of the undecidability on obfuscated contents, we propose to survey, classify and design deobfuscation methods for each obfuscating transformation. In this paper, we apply abstract syntax tree (AST) based methods to characterize obfuscating transformations found in malicious JavaScript samples. We are able to classify similar obfuscated codes based on AST fingerprints regardless of the original attack code. We are also able to quickly detect these obfuscating transformations by matching these in an analyzed sample´s AST using a pushdown automaton (PDA). The PDA accepts a set of sub trees representing obfuscating transformations previously learned. Such quick and lightweight sub tree matching algorithm has the potential to detect obfuscated pieces of code in a script, to be later extracted for deobfuscation.
  • Keywords
    Java; Web services; computational linguistics; invasive software; pattern matching; pushdown automata; tree data structures; AST fingerprint; PDA; Web attack kits; Web malware analyst; abstract syntax tree; code transformation; deobfuscation method; malicious JavaScript; malicious scripting content detection; obfuscated code classification; obfuscating transformation representation; obfuscation; pushdown automaton; subtree matching algorithm; Automata; Electronic mail; Feature extraction; Malware; Personal digital assistants; Proposals; Syntactics; JavaScript; abstract syntax tree; obfuscation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops (WAINA), 2012 26th International Conference on
  • Conference_Location
    Fukuoka
  • Print_ISBN
    978-1-4673-0867-0
  • Type

    conf

  • DOI
    10.1109/WAINA.2012.140
  • Filename
    6185286