DocumentCode
2571695
Title
Cache-aware load balancing vs. cooperative caching for distributed search engines
Author
Dominguez-Sal, David ; Perez-Casany, Marta ; Larriba-Pey, Josep Lluis
Author_Institution
Comput. Archit. Dept., DAMA-UPC, Barcelona, Spain
fYear
2009
fDate
25-27 June 2009
Firstpage
415
Lastpage
423
Abstract
In this paper we study the performance of a distributed search engine from a data caching point of view. We compare and combine two different approaches to achieve better hit rates: (a) send the queries to the node which currently has the related data in its local memory (cache-aware load balancing), and (b) send the cached contents to the node where a query is being currently processed (cooperative caching). Furthermore, we study the best scheduling points in the query computation in which they can be reassigned to another node, and how this reassignation should be performed. Our analysis is guided by statistical tools on a real question answering system for several query distributions, which are typically found in query logs.
Keywords
cache storage; distributed processing; query processing; resource allocation; scheduling; search engines; cache-aware load balancing; cooperative data caching; distributed search engine; distributed system; query log; query processing; question answering system; scheduling scheme; statistical tool; Cities and towns; Computational efficiency; Computer architecture; Cooperative caching; High performance computing; Load management; Mathematics; Natural languages; Processor scheduling; Search engines; Question Answering; cooperative caching; distributed systems; load balancing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Communications, 2009. HPCC '09. 11th IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4244-4600-1
Electronic_ISBN
978-0-7695-3738-2
Type
conf
DOI
10.1109/HPCC.2009.31
Filename
5167022
Link To Document