• DocumentCode
    658349
  • Title

    OnPerDis: Ontology-Based Personal Name Disambiguation on the Web

  • Author

    Zhao Lu ; Zhixian Yan ; Liang He

  • Author_Institution
    Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
  • Volume
    1
  • fYear
    2013
  • fDate
    17-20 Nov. 2013
  • Firstpage
    185
  • Lastpage
    192
  • Abstract
    With the growth of web documents, the ambiguity of personal name becomes more common and brings poor performance of web search. Identifying a correct personal entity from the a piece of or the whole document is still a very challenging problem, especially for Chinese websites. In this paper, we propose a novel Ontology-based approach for Personal Name Disambiguation (named "OnPerDis"). This approach has two main steps: first, we construct person ontology (PO) with rich conceptual modeling as well as a large set of supporting instances, second, for a given personal name on the web, we create a temporary instance and extract features from the web documents, calculate the similarity between this temporary instance and the instances in the PO. The one with the highest similarity score is chosen as the appropriate personal name. Our extensive evaluations with two rich real-life datasets (CIPS-SIGHAN 2012 NERD and Chinese web documents) shows OnPerDis\´ efficacy on personal name disambiguation on the Web.
  • Keywords
    Web sites; document handling; feature extraction; information retrieval; natural language processing; ontologies (artificial intelligence); pattern matching; CIPS-SIGHAN 2012 NERD; Chinese Web documents; Chinese Web sites; OnPerDis; PO; Web search; conceptual modeling; feature extraction; ontology-based personal name disambiguation; person ontology; personal entity Identification; personal name ambiguity; real-life datasets; similarity score; temporary instance; Data mining; Educational institutions; Encyclopedias; Feature extraction; Ontologies; Sociology; Statistics; Conceptual modeling; Instance matching; Ontology population; Personal name disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4799-2902-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2013.28
  • Filename
    6690013