Title :
An Incremental Algorithm for Clustering Search Results
Author :
Liu, Yongli ; Ouyang, Yuanxin ; Sheng, Hao ; Xiong, Zhang
Author_Institution :
Sch. of Comput. Sci. & Technol., Beihang Univ., Beijing
fDate :
Nov. 30 2008-Dec. 3 2008
Abstract :
When Internet users are facing massive search results, document clustering techniques are very helpful. Generally, existing clustering methods start with a known set of data objects, measured against a known set of attributes. However, there are numerous applications where the attribute set can only obtained gradually as processing data objects incrementally. This paper presents an incremental clustering algorithm (ICA) for clustering search results, which relies on pair-wise search result similarity calculated using Jaccard method. We use a measure namely, cluster average similarity area to score cluster cohesiveness. Experimental results show that our algorithm leads to less computational time than traditional clustering method while achieving a comparable or better clustering quality.
Keywords :
Internet; pattern clustering; search engines; Internet users; Jaccard method; clustering search results; document clustering; incremental algorithm; incremental clustering algorithm; pair-wise search result; Area measurement; Clustering algorithms; Clustering methods; Computer science; Histograms; Independent component analysis; Internet; Search engines; User-generated content; Web search; incremental clustering; internet; search results;
Conference_Titel :
Signal Image Technology and Internet Based Systems, 2008. SITIS '08. IEEE International Conference on
Conference_Location :
Bali
Print_ISBN :
978-0-7695-3493-0
DOI :
10.1109/SITIS.2008.53