Title :
Robust Disambiguation of Web-Based Personal Names
Author :
Chen, Ying ; Martin, James ; Palmer, Martha
Author_Institution :
Center for Comput. Language & Educ. Res., Univ. of Colorado at Boulder, Boulder, CO
Abstract :
Personal name ambiguity is common in the fast growing web resource. This paper explores robust features for web personal name disambiguation, which is totally unsupervised and is not limited to the given web corpus. The experiments show that the broad features not only can improve the performance, but also increase the robustness of a disambiguation system.
Keywords :
information analysis; information resources; Web corpus; Web resource; Web-based personal names; disambiguation system; personal name ambiguity; robust disambiguation; Biographies; Citation analysis; Clustering algorithms; Data mining; Degradation; Feature extraction; Filters; Frequency; Robustness; Search engines; Personal Name disambiguation; information extraction;
Conference_Titel :
Semantic Computing, 2008 IEEE International Conference on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-0-7695-3279-0
Electronic_ISBN :
978-0-7695-3279-0
DOI :
10.1109/ICSC.2008.36