DocumentCode
2424126
Title
Use link-based clustering to improve Web search results
Author
Wang, Yitong ; Kitsuregawa, Masaru
Author_Institution
Inst. of Ind. Sci., Tokyo Univ., Japan
Volume
1
fYear
2001
fDate
3-6 Dec. 2001
Firstpage
115
Abstract
While Web search engines can retrieve information on the Web for a specific topic, users have to step a long ordered list in order to locate the needed information, which is often tedious and less efficient. We propose a new link-based clustering approach to cluster search results returned from Web search engines by exploring both co-citation and coupling. Unlike document clustering algorithms in IR that are based on common words/phrases shared among documents, our approach is based on common links shared by pages. We also extend the standard clustering algorithm, K-means, to make it more natural to handle noise and apply it to Web search results. By filtering some irrelevant pages, our approach clusters high quality pages in Web search results into semantically meaningful groups to facilitate users accessing and browsing. Preliminary experiments and evaluations are conducted to investigate its effectiveness. The experimental results show that link-based clustering of Web search results is promising and beneficial.
Keywords
Internet; citation analysis; information resources; information retrieval; search engines; Internet; K-means; Web search engines; Web search results; clustering algorithm; co-citation; common links; coupling; document clustering; experiments; information retrieval; link-based clustering; Clustering algorithms; Data engineering; Data mining; Information filtering; Information filters; Information retrieval; Internet; Search engines; Web pages; Web search;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Systems Engineering, 2001. Proceedings of the Second International Conference on
Print_ISBN
0-7695-1393-X
Type
conf
DOI
10.1109/WISE.2001.996472
Filename
996472
Link To Document