Title :
Concept-based Web Search using Domain Prediction and Parallel Query Expansion
Author :
Joshi, Rahul R. ; Aslandogan, Y. Alp
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas, Arlington, TX
Abstract :
We address the problem of irrelevant results for short queries on Web search engines using latent semantic indexing in the WordSpace model and query expansion. First, we predict the potential concept topics, which are the domains for the search terms. Next, we expand the search terms in each of the predicted domains in parallel. We then submit separate queries, specialized for each domain, to a general-purpose search engine. The user is presented with categorized search results under the predicted domains. We prepared a categorized text collection (corpus) using Web directory listing to build word association models. We compare the results obtained using this corpus with those using Reuters corpus. User evaluations indicate that our approach helps the users avoid having to examine irrelevant Web search results, especially with short queries
Keywords :
Internet; indexing; query processing; search engines; Reuters corpus; Web directory; Web search engines; WordSpace model; concept-based Web search; domain prediction; latent semantic indexing; parallel query expansion; word association model; Biology; Cells (biology); Circuits; Computer science; Humans; Indexing; Java; Predictive models; Search engines; Web search;
Conference_Titel :
Information Reuse and Integration, 2006 IEEE International Conference on
Conference_Location :
Waikoloa Village, HI
Print_ISBN :
0-7803-9788-6
DOI :
10.1109/IRI.2006.252407