DocumentCode
710802
Title
Topic modeling of SSH logs using latent dirichlet allocation for the application in cyber security
Author
Aswani, Krishna ; Cronin, Aidan ; Xirui Liu ; Heyuan Zhao
Author_Institution
Univ. of Virginia, Charlottesville, VA, USA
fYear
2015
fDate
24-24 April 2015
Firstpage
75
Lastpage
79
Abstract
Cyber intrusions are one of the main causes of fear across the internet and now, due to the substantial increase in network traffic, detection of each unauthorized access has become extremely difficult. Brute-force attacks are the most common form of malicious traffic. To prevent such attacks and detect them in real time many new techniques have been developed. The majority of these techniques monitor the sequential transfers between users/IPs and the network. However, though many networks are now monitoring their logs and can identify when brute-force attacks occur, they cannot provide more detailed information about the attack (such as where and how) without some form of direct visual inspection of the logs. In this paper, we explore a Latent Dirichlet Allocation as a form of topic modeling of IP addresses through SSH authentication logs with the final goal of automating classifications of users. Using textual topics or the “top words” associated with logs, we differentiate legitimate users and brute-attackers users according to their IP addresses and discuss the potential of topic modelling for identifying and further classification of cyber threats.
Keywords
IP networks; Internet; authorisation; computer network security; pattern classification; telecommunication traffic; IP addresses; Internet; SSH authentication logs; brute-force attacks; cyber intrusions; cyber security; cyber threats classification; direct visual inspection; latent dirichlet allocation; malicious traffic; network traffic; sequential transfers; textual topics; topic modeling; unauthorized access; Data models; Feature extraction; Force; Hidden Markov models; IP networks; Resource management; Servers; Brute-force attacks; LDA; SSH logs; Topic model;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems and Information Engineering Design Symposium (SIEDS), 2015
Conference_Location
Charlottesville, VA
Print_ISBN
978-1-4799-1831-7
Type
conf
DOI
10.1109/SIEDS.2015.7117015
Filename
7117015
Link To Document