DocumentCode :
1788984
Title :
Signature based malware detection for unstructured data in Hadoop
Author :
Sahoo, Abhaya Kumar ; Sahoo, Kshira Sagar ; Tiwary, Mayank
Author_Institution :
Dept. of Inf. Technol., C.V. Raman Coll. of Eng., Bhubaneswar, India
fYear :
2014
fDate :
10-11 Oct. 2014
Firstpage :
1
Lastpage :
6
Abstract :
Hadoop is a very efficient distributed processing framework. It´s based on map-reduce approach where the application is divided into small fragments of work, each of which may be executed on any node in the cluster. Hadoop is very efficient tool in storing and processing unstructured, semi-structured and structured data. Unstructured data usually refers to the data stored in files not in traditional row and column way. Examples of unstructured data is e-mail messages, videos, audio files, photos, web-pages, and many other kinds of business documents. Our work primarily focuses on detecting malware for unstructured data stored in Hadoop distributed file system environment. Here we use calm AV´s updated free virus signature database. We also propose a fast string search algorithm based on map-reduce approach.
Keywords :
computer viruses; digital signatures; distributed databases; parallel processing; search problems; string matching; Hadoop distributed file system environment; distributed processing framework; free virus signature database; map-reduce approach; semi-structured data; signature based malware detection; string search algorithm; unstructured data; Clustering algorithms; Computers; Distributed databases; File systems; Malware; Pattern matching; Cluster; Hadoop; Malwares; Map-reduce; Pattern Matching; Signatures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Electronics, Computers and Communications (ICAECC), 2014 International Conference on
Conference_Location :
Bangalore
Type :
conf
DOI :
10.1109/ICAECC.2014.7002394
Filename :
7002394
Link To Document :
بازگشت