Title :
An efficient cost model for optimization of nearest neighbor search in low and medium dimensional spaces
Author :
Tao, Yufei ; Zhang, Jun ; Papadias, Dimitris ; Mamoulis, Nikos
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, China
Abstract :
Existing models for nearest neighbor search in multidimensional spaces are not appropriate for query optimization because they either lead to erroneous estimation or involve complex equations that are expensive to evaluate in real-time. This article proposes an alternative method that captures the performance of nearest neighbor queries using approximation. For uniform data, our model involves closed formulae that are very efficient to compute and accurate for up to 10 dimensions. Further, the proposed equations can be applied on nonuniform data with the aid of histograms. We demonstrate the effectiveness of the model by using it to solve several optimization problems related to nearest neighbor search.
Keywords :
approximation theory; information retrieval systems; query formulation; query processing; tree searching; closed formulae; cost model; histogram; information retrieval; information storage; multidimensional space; nearest neighbor search optimization; query approximation; query optimization; selection process; Computer science; Cost function; Equations; Histograms; Multidimensional systems; Multimedia databases; Nearest neighbor searches; Neural networks; Query processing; Time series analysis; 65; Index Terms- Information storage and retrieval; selection process.;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2004.48