DocumentCode
3165844
Title
Language-Independent Set Expansion of Named Entities Using the Web
Author
Wang, Richard C. ; Cohen, William W.
Author_Institution
Carnegie Mellon Univ., Pittsburgh
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
342
Lastpage
350
Abstract
Set expansion refers to expanding a given partial set of objects into a more complete set. A well-known example system that does set expansion using the web is Google Sets. In this paper, we propose a novel method for expanding sets of named entities. The approach can be applied to semi-structured documents written in any markup language and in any human language. We present experimental results on 36 benchmark sets in three languages, showing that our system is superior to Google Sets in terms of mean average precision.
Keywords
Internet; document handling; hypermedia markup languages; natural language processing; search engines; Google sets; World Wide Web; human language; language-independent set expansion; markup language; named entity; semistructured documents; Cancer; Collaborative work; Data mining; Humans; Information filtering; Information filters; Markup languages; Seals; USA Councils; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
ISSN
1550-4786
Print_ISBN
978-0-7695-3018-5
Type
conf
DOI
10.1109/ICDM.2007.104
Filename
4470258
Link To Document