• DocumentCode
    2897812
  • Title

    Malware Variant Detection Using Similarity Search over Sets of Control Flow Graphs

  • Author

    Cesare, Silvio ; Xiang, Yang

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Burwoord, VIC, Australia
  • fYear
    2011
  • fDate
    16-18 Nov. 2011
  • Firstpage
    181
  • Lastpage
    189
  • Abstract
    Static detection of polymorphic malware variants plays an important role to improve system security. Control flow has shown to be an effective characteristic that represents polymorphic malware instances. In our research, we propose a similarity search of malware using novel distance metrics of malware signatures. We describe a malware signature by the set of control flow graphs the malware contains. We propose two approaches and use the first to perform pre-filtering. Firstly, we use a distance metric based on the distance between feature vectors. The feature vector is a decomposition of the set of graphs into either fixed size k-subgraphs, or q-gram strings of the high-level source after decompilation. We also propose a more effective but less computationally efficient distance metric based on the minimum matching distance. The minimum matching distance uses the string edit distances between programs´ decompiled flow graphs, and the linear sum assignment problem to construct a minimum sum weight matching between two sets of graphs. We implement the distance metrics in a complete malware variant detection system. The evaluation shows that our approach is highly effective in terms of a limited false positive rate and our system detects more malware variants when compared to the detection rates of other algorithms.
  • Keywords
    flow graphs; invasive software; control flow graphs; distance metric; malware signatures; malware variant detection; polymorphic malware; similarity search; static detection; Feature extraction; Flow graphs; Malware; Measurement; Software; Support vector machine classification; Vectors; computer security; control flow; decompilation; malware classification; static analysi; structuring;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Trust, Security and Privacy in Computing and Communications (TrustCom), 2011 IEEE 10th International Conference on
  • Conference_Location
    Changsha
  • Print_ISBN
    978-1-4577-2135-9
  • Type

    conf

  • DOI
    10.1109/TrustCom.2011.26
  • Filename
    6120818