• DocumentCode
    3165844
  • Title

    Language-Independent Set Expansion of Named Entities Using the Web

  • Author

    Wang, Richard C. ; Cohen, William W.

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh
  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    342
  • Lastpage
    350
  • Abstract
    Set expansion refers to expanding a given partial set of objects into a more complete set. A well-known example system that does set expansion using the web is Google Sets. In this paper, we propose a novel method for expanding sets of named entities. The approach can be applied to semi-structured documents written in any markup language and in any human language. We present experimental results on 36 benchmark sets in three languages, showing that our system is superior to Google Sets in terms of mean average precision.
  • Keywords
    Internet; document handling; hypermedia markup languages; natural language processing; search engines; Google sets; World Wide Web; human language; language-independent set expansion; markup language; named entity; semistructured documents; Cancer; Collaborative work; Data mining; Humans; Information filtering; Information filters; Markup languages; Seals; USA Councils; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3018-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2007.104
  • Filename
    4470258