DocumentCode :
2585295
Title :
Cost-Aware Processing of Similarity Queries in Structured Overlays
Author :
Karnstedt, Marcel ; Sattler, Kai-Uwe ; Hauswirth, Manfred ; Schmidt, Roman
Author_Institution :
Technische Univ. Ilmenau
fYear :
2006
fDate :
6-8 Sept. 2006
Firstpage :
81
Lastpage :
89
Abstract :
Large-scale distributed data management with P2P systems requires the existence of similarity operators for queries as we cannot assume that all users agree on exactly the same schema and value representations and data quality problems due to spelling errors and typos. In this paper, we present an approach for efficient processing of similarity selections and joins in a structured overlay. We show that there are several possible strategies exploiting DHT features to a different extent (i.e., key organization, routing, multicasting) and thus the choice of the best operator implementation in a given situation (selectivity, data distribution, load) should be based on cost information allowing the system to estimate the computation and communication costs of query execution plans. Hence, we present a cost model for similarity operations on structured data in a DHT and demonstrate the efficiency of our proposal by experimental results from a large-scale PlanetLab deployment
Keywords :
data handling; database management systems; distributed processing; peer-to-peer computing; query processing; DHT feature; P2P system; PlanetLab deployment; communication cost information; cost model; cost-aware processing; distributed data management; distributed hash table; peer-to-peer system; query execution plan; similarity operation; similarity query; similarity selection; structured data; structured overlay; Centralized control; Communication system control; Control systems; Costs; Distributed computing; Information management; Large-scale systems; Proposals; Quality management; Routing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Peer-to-Peer Computing, 2006. P2P 2006. Sixth IEEE International Conference on
Conference_Location :
Cambridge
Print_ISBN :
0-7695-2679-9
Type :
conf
DOI :
10.1109/P2P.2006.12
Filename :
1698597
Link To Document :
بازگشت