DocumentCode
3390999
Title
Discovering topics from dark websites
Author
Yang, Li ; Liu, Feiqiong ; Kizza, Joseph M. ; Ege, Raimund K.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Tennessee at Chattanooga, Chattanooga, TN
fYear
2009
fDate
March 30 2009-April 2 2009
Firstpage
175
Lastpage
179
Abstract
Analysis of dark Websites is important for developing effective combating strategies against terrorism or extremists when more and more scattered terrorist cells use the ubiquity of the Internet to form communities in virtual space with fairly low costs. Terrorists or extremists anonymously set up various Web sites embedded in the public Internet, exchanging ideology, spreading propaganda, and recruiting new members. In this paper, we propose a framework to discover latent topics via analyzing contents of dark Websites. The content and data from dark Websites are gathered and extracted by crawlers and exported to documents. Latent Dirichlet allocation (LDA) algorithm is used to analyze the extracted documents so as to discover latent topics from web sites of terrorists or extremists. In contrast to the traditional information retrieval (IR) schemes, LDA-based analysis assigns a probability to a document and captures exchangeability of both words and documents. Our work helps to gain insights into the structure and communities of terrorists and extremists.
Keywords
Internet; security of data; Internet; dark Websites; information retrieval schemes; latent Dirichlet allocation; Algorithm design and analysis; Costs; Crawlers; Data mining; Information retrieval; Internet; Linear discriminant analysis; Recruitment; Scattering; Terrorism;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Cyber Security, 2009. CICS '09. IEEE Symposium on
Conference_Location
Nashville, TN
Print_ISBN
978-1-4244-2769-7
Type
conf
DOI
10.1109/CICYBS.2009.4925106
Filename
4925106
Link To Document