Title :
Search Results Clustering without External Resources
Author :
Chris Staff;Joel Azzopardi;Colin Layfield;Daniel Mercieca
Author_Institution :
Fac. of Inf. &
Abstract :
Our unsupervised Search Results Clustering (SRC) system partitions into clusters the top-n results returned by a search engine. We present the results of experiments with our SRC system that performs incremental clustering on document titles and snippets only and does not use external resources, yet which outperforms the best performers to date on the SemEval-2013 Task 11 gold standard. We include Latent Semantic Analysis (LSA) as an optional step, using the snippets themselves as the background corpus. We demonstrate that better results are achieved by leaving the query terms out of the clustering process, and that currently, the version without LSA outperforms the version with LSA.
Keywords :
"Clustering algorithms","Conferences","Semantics","Indexes","Search engines","Standards","Gold"
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2015 26th International Workshop on
Print_ISBN :
978-1-4673-7581-8
Electronic_ISBN :
2378-3915
DOI :
10.1109/DEXA.2015.67