Title :
Web search via hub synthesis
Author :
Achlioptas, Dimitris ; Fiat, Amos ; Karlin, Anna R. ; McSherry, Frank
Author_Institution :
Microsoft Res., Redmond, WA, USA
Abstract :
We present a model for web search that captures in a unified manner three critical components of the problem: how the link structure of the web is generated, how the content of a web document is generated, and how a human searcher generates a query. The key to this unification lies in capturing the correlations between these components in terms of proximity in a shared latent semantic space. Given such a combined model, the correct answer to a search query is well defined, and thus it becomes possible to evaluate web search algorithms rigorously. We present a new web search algorithm, based on spectral techniques, and prove that it is guaranteed to produce an approximately correct answer in our model. The algorithm assumes no knowledge of the model, and is well-defined regardless of the model´s accuracy.
Keywords :
Internet; information resources; query processing; hub synthesis; search query; shared latent semantic space; spectral techniques; web document; web search; Books; Computer science; Couplings; Embedded computing; Humans; Information retrieval; Search engines; Web pages; Web search;
Conference_Titel :
Foundations of Computer Science, 2001. Proceedings. 42nd IEEE Symposium on
Print_ISBN :
0-7695-1116-3
DOI :
10.1109/SFCS.2001.959926