• DocumentCode
    1416175
  • Title

    Digging Deeper into Text Mining: Academics and Agencies Look Toward Unstructured Data

  • Author

    Goth, Greg

  • Volume
    16
  • Issue
    1
  • fYear
    2012
  • Firstpage
    7
  • Lastpage
    9
  • Abstract
    In an effort to help government officials anticipate significant events such as political unrest, disease outbreaks, or natural disasters, the US government´s Intelligence Advanced Research Projects Agency is launching a mass dataset mining effort, hoping to develop technologies that can mine disparate sources such as blogs, search engine results, Internet traffic, webcams, and many others. Researchers in the natural and social sciences have long been doing similar work, however, which might serve to show the current limitations of computational linguistics, especially in trying to discern, on the fly, events that could have significant policy implications.
  • Keywords
    Internet; Web sites; computational linguistics; data mining; public administration; public domain software; research and development; text analysis; Internet traffic; US government; Web search queries; Wikipedia edits; academics; agencies; blogs; computational linguistics; disease outbreaks; financial markets; intelligence advanced research projects agency; mass dataset mining effort; mass public dataset analysis; microblogs; natural disasters; natural sciences; open source indicators program; political unrest; social sciences; text mining; traffic webcams; unstructured data; Analytical models; Broadband communication; Data mining; Internet; Open systems; Text mining; Twitter; IARPA; computational linguistics; data mining; large datasets; public policy;
  • fLanguage
    English
  • Journal_Title
    Internet Computing, IEEE
  • Publisher
    ieee
  • ISSN
    1089-7801
  • Type

    jour

  • DOI
    10.1109/MIC.2012.6
  • Filename
    6123697