DocumentCode :
2365564
Title :
Detecting Unknown Malicious Executables Using Portable Executable Headers
Author :
Wang, Tzu-Yen ; Wu, Chin-Hsiung ; Hsieh, Chu-Cheng
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2009
fDate :
25-27 Aug. 2009
Firstpage :
278
Lastpage :
284
Abstract :
Even though numerous kinds of anti-virus software packages have been used for many years, previously unseen malware is still a serious threat to computer and information system. By analyzing portable executable header entries of executables, a malware detection model which consists of four stages: attribute extraction, attribute binarization, attribute elimination, and feature selection and classifier training was carried out in this study. First, we collected header entries from all executables in our dataset and viewed each entry as a potential attribute. Second, information gain and gain ratio were used to binarize numerical and nominal attributes. Next, useless and redundant attributes were eliminated in the third stage. Finally, by using support vector machine which is a classification algorithm of conspicuous generalization ability, feature selection was simultaneously performed with classifier training to reduce the number of attributes and retain the performance of classifier in a cost-effective. We evaluated our model by 1,908 benign programs and 7,863 malicious files (virus, email worm, trojan and backdoor) and estimated its generalization ability by cross validation. The experiment results showed that our model had promising performance for detecting virus and email worm.
Keywords :
feature extraction; invasive software; pattern classification; support vector machines; anti-virus software packages; attribute binarization; attribute elimination; attribute extraction; binarize numerical; classification algorithm; classifier training; cross validation; email worm; feature selection; gain ratio; information gain; malware detection model; nominal attributes; portable executable headers; previously unseen malware; support vector machine; virus detection; Classification algorithms; Computer crashes; Computer worms; Data mining; Information systems; Performance gain; Portable computers; Software packages; Support vector machine classification; Support vector machines; feature selection; gain ratio; information gain; previously unseen malware; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-5209-5
Electronic_ISBN :
978-0-7695-3769-6
Type :
conf
DOI :
10.1109/NCM.2009.385
Filename :
5331712
Link To Document :
بازگشت