DocumentCode
1416175
Title
Digging Deeper into Text Mining: Academics and Agencies Look Toward Unstructured Data
Author
Goth, Greg
Volume
16
Issue
1
fYear
2012
Firstpage
7
Lastpage
9
Abstract
In an effort to help government officials anticipate significant events such as political unrest, disease outbreaks, or natural disasters, the US government´s Intelligence Advanced Research Projects Agency is launching a mass dataset mining effort, hoping to develop technologies that can mine disparate sources such as blogs, search engine results, Internet traffic, webcams, and many others. Researchers in the natural and social sciences have long been doing similar work, however, which might serve to show the current limitations of computational linguistics, especially in trying to discern, on the fly, events that could have significant policy implications.
Keywords
Internet; Web sites; computational linguistics; data mining; public administration; public domain software; research and development; text analysis; Internet traffic; US government; Web search queries; Wikipedia edits; academics; agencies; blogs; computational linguistics; disease outbreaks; financial markets; intelligence advanced research projects agency; mass dataset mining effort; mass public dataset analysis; microblogs; natural disasters; natural sciences; open source indicators program; political unrest; social sciences; text mining; traffic webcams; unstructured data; Analytical models; Broadband communication; Data mining; Internet; Open systems; Text mining; Twitter; IARPA; computational linguistics; data mining; large datasets; public policy;
fLanguage
English
Journal_Title
Internet Computing, IEEE
Publisher
ieee
ISSN
1089-7801
Type
jour
DOI
10.1109/MIC.2012.6
Filename
6123697
Link To Document