DocumentCode :
1961159
Title :
Efficiently supporting multiple similarity queries for mining in metric databases
Author :
Braunmüller, Bernhard ; Ester, Martin ; Kriegel, Hans-Peter ; Sander, Jörg
Author_Institution :
Munchen Univ., Germany
fYear :
2000
fDate :
2000
Firstpage :
256
Lastpage :
267
Abstract :
Metric databases are databases where a metric distance function is defined for pairs of database objects. In such databases, similarity queries in the form of range queries or k-nearest neighbor queries are the most important queries. In traditional query processing, single queries are issued independently by different users. In many data mining applications, however, the database is typically explored by iteratively asking similarity queries for answers of previous similarity queries. In this paper, we introduce a generic scheme for such data mining algorithms and we investigate two orthogonal approaches, reducing I/O cost as well as CPU cost, to speed-up the processing of multiple similarity queries. The proposed techniques apply to any type of similarity query and to an implementation based on an index or using a sequential scan. Parallelization yields an additional impressive speed-up. An extensive performance evaluation confirms the efficiency of our approach
Keywords :
data mining; performance evaluation; query processing; visual databases; data mining; generic scheme; k-nearest neighbor queries; metric databases; multiple similarity queries; performance evaluation; range queries; Data mining; Electrical capacitance tomography; Euclidean distance; Histograms; Image databases; Nearest neighbor searches; Query processing; Spatial databases; Spatial indexes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2000. Proceedings. 16th International Conference on
Conference_Location :
San Diego, CA
ISSN :
1063-6382
Print_ISBN :
0-7695-0506-6
Type :
conf
DOI :
10.1109/ICDE.2000.839418
Filename :
839418
Link To Document :
بازگشت