Title :
Mining DNS for malicious domain registrations
Author :
He, Yuanchen ; Zhong, Zhenyu ; Krasser, Sven ; Tang, Yuchun
Author_Institution :
McAfee Inc., Alpharetta, GA, USA
Abstract :
Millions of new domains are registered every day and the many of them are malicious. It is challenging to keep track of malicious domains by only Web content analysis due to the large number of domains. One interesting pattern in legitimate domain names is that many of them consist of English words or look like meaningful English while many malicious domain names are randomly generated and do not include meaningful words. We show that it is possible to transform this intuitive observation into statistically informative features using second order Markov models. Four transition matrices are built from known legitimate domain names, known malicious domain names, English words in a dictionary, and based on a uniform distribution. The probabilities from these Markov models, as well as other features extracted from DNS data, are used to build a Random Forest classifier. The experimental results demonstrate that our system can quickly catch malicious domains with a low false positive rate.
Keywords :
Internet; Markov processes; data mining; matrix algebra; pattern classification; security of data; DNS mining; Web content analysis; domain name system; malicious domain registrations; random forest classifier; second order Markov models; transition matrices; Decision trees; Helium; Machine learning; Manuals; Markov processes; Vegetation;
Conference_Titel :
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2010 6th International Conference on
Conference_Location :
Chicago, IL
Print_ISBN :
978-963-9995-24-6