Title :
A method of Web search result clustering based on rough sets
Author :
Ngo, Chi Lang ; Nguyen, Hung Son
Author_Institution :
Inst. of Math., Warsaw Univ., Poland
Abstract :
Due to the enormous size of the Web and low precision of user queries, finding the right information from the Web can be difficult if not impossible. One approach that tries to solve this problem is using clustering techniques for grouping similar document together in order to facilitate presentation of results in more compact form and enable thematic browsing of the results set. The main problem of many Web search result (snippet) clustering algorithm is based on the poor vector representation of snippets. In this paper, we present a method of snippet representation enrichment using tolerance rough set model. We applied the proposed method to construct a rough set based search result clustering algorithm and compared it with other recent methods.
Keywords :
Internet; document handling; information retrieval; pattern clustering; rough set theory; Web search; document clustering; snippet representation enrichment; tolerance rough set model; vector representation; Algorithm design and analysis; Clustering algorithms; Mathematics; Navigation; Rough sets; Scalability; Search engines; Set theory; Web pages; Web search; clustering; rough sets; snippet;
Conference_Titel :
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2415-X