DocumentCode :
175345
Title :
ALPD: Active Learning Framework for Enhancing the Detection of Malicious PDF Files
Author :
Nissim, Nir ; Cohen, Asaf ; Moskovitch, Robert ; Shabtai, Asaf ; Edry, Mattan ; Bar-Ad, Oren ; Elovici, Yuval
Author_Institution :
Dept. of Inf. Syst. Eng., Ben-Gurion Univ. of the Negev, Beer-Sheva, Israel
fYear :
2014
fDate :
24-26 Sept. 2014
Firstpage :
91
Lastpage :
98
Abstract :
Email communication carrying malicious attachments or links is often used as an attack vector for initial penetration of the targeted organization. Existing defense solutions prevent executables from entering organizational networks via emails, therefore recent attacks tend to use non-executable files such as PDF. Machine learning algorithms have recently been applied for detecting malicious PDF files. These techniques, however, lack an essential element - they cannot be updated daily. In this study we present ALPD, a framework that is based on active learning methods that are specially designed to efficiently assist anti-virus vendors to focus their analytical efforts. This is done by identifying and acquiring new PDF files that are most likely malicious, as well as informative benign PDF documents. These files are used for retraining and enhancing the knowledge stores. Evaluation results show that in the final day of the experiment, Combination, one of our AL methods, outperformed all the others, enriching the anti-virus´s signature repository with almost seven times more new PDF malware while also improving the detection model´s performance on a daily basis.
Keywords :
computer viruses; digital signatures; learning (artificial intelligence); ALPD; PDF malware; active learning framework; antivirus signature repository; attack vector; email communication; emails; informative benign PDF documents; machine learning algorithms; malicious PDF file detection; malicious attachments; organizational networks; Electronic mail; Feature extraction; Malware; Organizations; Portable document format; Support vector machines; Training; Active Learning; Machine Learning; Malware; PDF;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence and Security Informatics Conference (JISIC), 2014 IEEE Joint
Conference_Location :
The Hague
Print_ISBN :
978-1-4799-6363-8
Type :
conf
DOI :
10.1109/JISIC.2014.23
Filename :
6975559
Link To Document :
بازگشت