Title :
Signature based malware detection for unstructured data in Hadoop
Author :
Sahoo, Abhaya Kumar ; Sahoo, Kshira Sagar ; Tiwary, Mayank
Author_Institution :
Dept. of Inf. Technol., C.V. Raman Coll. of Eng., Bhubaneswar, India
Abstract :
Hadoop is a very efficient distributed processing framework. It´s based on map-reduce approach where the application is divided into small fragments of work, each of which may be executed on any node in the cluster. Hadoop is very efficient tool in storing and processing unstructured, semi-structured and structured data. Unstructured data usually refers to the data stored in files not in traditional row and column way. Examples of unstructured data is e-mail messages, videos, audio files, photos, web-pages, and many other kinds of business documents. Our work primarily focuses on detecting malware for unstructured data stored in Hadoop distributed file system environment. Here we use calm AV´s updated free virus signature database. We also propose a fast string search algorithm based on map-reduce approach.
Keywords :
computer viruses; digital signatures; distributed databases; parallel processing; search problems; string matching; Hadoop distributed file system environment; distributed processing framework; free virus signature database; map-reduce approach; semi-structured data; signature based malware detection; string search algorithm; unstructured data; Clustering algorithms; Computers; Distributed databases; File systems; Malware; Pattern matching; Cluster; Hadoop; Malwares; Map-reduce; Pattern Matching; Signatures;
Conference_Titel :
Advances in Electronics, Computers and Communications (ICAECC), 2014 International Conference on
Conference_Location :
Bangalore
DOI :
10.1109/ICAECC.2014.7002394