• DocumentCode
    1791658
  • Title

    Researching persons & organizations: AWAKE: From text to an entity-centric knowledge base

  • Author

    Boschee, Elizabeth ; Freedman, Marjorie ; Khanwalkar, Saurabh ; Kumar, Ajit ; Srivastava, Anurag ; Weischedel, Ralph

  • Author_Institution
    Raytheon BBN Technol. Corp., Cambridge, MA, USA
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    1030
  • Lastpage
    1039
  • Abstract
    We describe a pilot experiment building a capability to automatically read documents, develop a knowledge base, support analytics, and visualize the information found. The capability allows someone researching a topic of interest of focus on analysis and synthesis rather than on reading. We show how information from multiple modalities (speech, text, structured databases) and multiple approaches (ontology driven and open information extraction) can be fused to create a resource about both previously known and novel entities. We describe an extensible framework for language understanding tools that allows for scalability, plug-and-play of alternative components, and incorporation of additional input streams, including video, images, and foreign language text.
  • Keywords
    data mining; document handling; AWAKE; documents; entity-centric knowledge base; foreign language text; information extraction; information visualisation; language understanding tools; multiple modalities; plug-and-play; scalability; Algorithm design and analysis; Computer architecture; Databases; Knowledge based systems; Ontologies; Organizations; Pipelines; automatic knowledge base construction. information extraction; entity disambiguation; entity discovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004337
  • Filename
    7004337