DocumentCode
3123588
Title
An information retrieval approach for malware classification based on Windows API calls
Author
Cheng, Julia Yu-Chin ; Tzung-Shian Tsai ; Chu-Sing Yang
Author_Institution
Inst. of Comput. & Commun. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume
04
fYear
2013
fDate
14-17 July 2013
Firstpage
1678
Lastpage
1683
Abstract
Automated malware toolkits allow for easy generation of new malicious programs. These new executables carry similar malicious code and demonstrate similar malicious behavior on infected hosts. In order to speed up the efficiency of mal ware detection, discriminating a malware as known or a new species of malware has become a critical issue in the security industry. In this paper, we propose a new approach to precisely classify malicious executables by employing information retrieval theory. Dynamic analysis of a sample´s sequence of Windows API function calls produces corresponding parameters and values which is used as input to a standard TF-IDF weighting scheme to identify malware families by their behavior characteristics. Irrelevance reduction is developed to filter out non-relevant features and improve accuracy of malware classification. Finally, a similarity measure is used to determine the most similar malware family to the tested samples.
Keywords
application program interfaces; information retrieval; invasive software; pattern classification; Windows API calls; Windows API function; automated malware toolkits; dynamic analysis; infected hosts; information retrieval approach; malicious behavior; malicious code; malicious programs; malware classification; malware detection; security industry; Abstracts; Malware; Vectors; IDF; Information retrieval; Malware classification; Similarity measure; TF; Windows API calls;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2013 International Conference on
Conference_Location
Tianjin
Type
conf
DOI
10.1109/ICMLC.2013.6890868
Filename
6890868
Link To Document