DocumentCode :
1824823
Title :
Spatio-temporal mining of software adoption & penetration
Author :
Papalexakis, Evangelos E. ; Dumitras, Tudor ; Duen Horng Chau ; Prakash, B. Aditya ; Faloutsos, Christos
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
878
Lastpage :
885
Abstract :
How does malware propagate? Does it form spikes over time? Does it resemble the propagation pattern of benign files, such as software patches? Does it spread uniformly over countries? How long does it take for a URL that distributes malware to be detected and shut down? In this work, we answer these questions by analyzing patterns from 22 million malicious (and benign) files, found on 1.6 million hosts worldwide during the month of June 2011. We conduct this study using the WINE database available at Symantec Research Labs. Additionally, we explore the research questions raised by sampling on such large databases of executables; the importance of studying the implications of sampling is twofold: First, sampling is a means of reducing the size of the database hence making it more accessible to researchers; second, because every such data collection can be perceived as a sample of the real world. Finally, we discover the SHARKFIN temporal propagation pattern of executable files, the GEOSPLIT pattern in the geographical spread of machines that report executables to Symantec´s servers, the Periodic Power Law (PPL) distribution of the life-time of URLs, and we show how to efficiently extrapolate crucial properties of the data from a small sample. To the best of our knowledge, our work represents the largest study of propagation patterns of executables.
Keywords :
data mining; invasive software; GEOSPLIT pattern; PPL distribution; SHARKFIN temporal propagation pattern; Symantec servers; URL; WINE database; benign files; database size reduction; executable propagation patterns; malware; pattern analysis; periodic power law distribution; software adoption; software patches; software penetration; spatio-temporal mining; Data models; Databases; Extrapolation; Internet; Malware; Software; Data Analysis; Internet Security; Malware Propagation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
Conference_Location :
Niagara Falls, ON
Type :
conf
Filename :
6785804
Link To Document :
بازگشت