• DocumentCode
    1141995
  • Title

    Web People Search via Connection Analysis

  • Author

    Kalashnikov, Dmitri V. ; Chen, Zhaoqi Stella ; Mehrotra, Sharad ; Nuray-Turan, Rabia

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Irvine, CA
  • Volume
    20
  • Issue
    11
  • fYear
    2008
  • Firstpage
    1550
  • Lastpage
    1565
  • Abstract
    Nowadays, searches for Webpages of a person with a given name constitute a notable fraction of queries to web search engines. Such a query would normally return Webpages related to several namesakes, who happened to have the queried name, leaving the burden of disambiguating and collecting pages relevant to a particular person (from among the namesakes) on the user. In this article we develop a Web People Search approach that clusters Webpages based on their association to different people. Our method exploits a variety of semantic information extracted from Web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the Web pages. We demonstrate the effectiveness of our approach by testing the efficacy of the disambiguation algorithms and its impact on person search.
  • Keywords
    Internet; search engines; Web people search; Webpage; connection analysis; search engine; semantic information extraction; Clustering algorithms; Data mining; Internet; Machine learning; Psychology; Search engines; Social network services; Testing; Web pages; Web search; Clustering; Database Management; Information Storage and Retrieval; Internet search; Search process; Web Search; Web mining; and association rules; classification;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.78
  • Filename
    4497193