• DocumentCode
    2753466
  • Title

    Disambiguation of People in Web Search Using a Knowledge Base

  • Author

    Vu, Quang Minh ; Masada, Tomonari ; Takasu, Atsuhiro ; Adachi, Jun

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo
  • fYear
    2007
  • fDate
    5-9 March 2007
  • Firstpage
    185
  • Lastpage
    191
  • Abstract
    Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different people, an effective method is needed to measure document similarities and to find documents related to the same person. Some previous researchers have used the vector space model or have tried to extract common named entities for measuring similarities. We propose a new method that uses Web directories as a knowledge base to find shared contexts in document pairs and uses the measurement of shared contexts to determine similarities between document pairs. Experimental results show that our proposed method outperforms the vector space model method and the named entity recognition method.
  • Keywords
    Internet; document handling; query processing; Web directories; Web search; document similarity measurement; knowledge based system; people disambiguation; personal names; queries; vector space model; Character recognition; Data mining; Informatics; Information science; Internet; Search engines; Social network services; Web pages; Web search; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research, Innovation and Vision for the Future, 2007 IEEE International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    1-4244-0694-3
  • Type

    conf

  • DOI
    10.1109/RIVF.2007.369155
  • Filename
    4223072