DocumentCode
1141995
Title
Web People Search via Connection Analysis
Author
Kalashnikov, Dmitri V. ; Chen, Zhaoqi Stella ; Mehrotra, Sharad ; Nuray-Turan, Rabia
Author_Institution
Dept. of Comput. Sci., California Univ., Irvine, CA
Volume
20
Issue
11
fYear
2008
Firstpage
1550
Lastpage
1565
Abstract
Nowadays, searches for Webpages of a person with a given name constitute a notable fraction of queries to web search engines. Such a query would normally return Webpages related to several namesakes, who happened to have the queried name, leaving the burden of disambiguating and collecting pages relevant to a particular person (from among the namesakes) on the user. In this article we develop a Web People Search approach that clusters Webpages based on their association to different people. Our method exploits a variety of semantic information extracted from Web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the Web pages. We demonstrate the effectiveness of our approach by testing the efficacy of the disambiguation algorithms and its impact on person search.
Keywords
Internet; search engines; Web people search; Webpage; connection analysis; search engine; semantic information extraction; Clustering algorithms; Data mining; Internet; Machine learning; Psychology; Search engines; Social network services; Testing; Web pages; Web search; Clustering; Database Management; Information Storage and Retrieval; Internet search; Search process; Web Search; Web mining; and association rules; classification;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2008.78
Filename
4497193
Link To Document