DocumentCode
2292289
Title
Web Mining for Open Source Intelligence
Author
Best, Clive
Author_Institution
Joint Res. Centre, OSVision Ltd., London
fYear
2008
fDate
9-11 July 2008
Firstpage
321
Lastpage
325
Abstract
Web mining for open source intelligence is the retrieval, extraction and analysis of information from on-line Internet sites. There are two separate applications areas this paper will review, namely live news-monitoring and targeted topic based data mining. Most newspapers and news agencies have Web sites with live updates on unfolding events, opinions and perspectives on world events. Most governments monitor news reports to feel the pulse of public opinion, and for early warning of emerging crises. The Joint Research Centre has developed significant experience in Internet content monitoring through its work on media monitoring (EMM) for the European Commission. EMM forms the core of the Commission´s daily press monitoring service. Intelligence services and law enforcement agencies also require specific site monitoring and topic monitoring, and EMM technology has been applied to the wider Internet for this purpose. The software extracts and downloads all the textual content from monitored sites and applies information extraction techniques. These tools help analysts process large amounts of documents to derive structured data. Lastly the visualisation of the extracted data is important for analysts to identify patterns and trends derived from both news reports and Web mining.
Keywords
Internet; Web sites; content management; data mining; data visualisation; information retrieval; Internet content monitoring; Web mining; Web site; data mining; data visualisation; information analysis; information extraction; information retrieval; intelligence service; live news-monitoring; media monitoring; online Internet site; open source intelligence; Data mining; Data visualization; Government; Information analysis; Information retrieval; Law enforcement; Monitoring; Pattern analysis; Web and internet services; Web mining; Information Extraction; Media Monitoring; Multilinguality; Visualisation; Web Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Visualisation, 2008. IV '08. 12th International Conference
Conference_Location
London
ISSN
1550-6037
Print_ISBN
978-0-7695-3268-4
Type
conf
DOI
10.1109/IV.2008.86
Filename
4577966
Link To Document