Title :
Towards optimal clustering for approximate similarity searching
Author :
Tuncel, Ertem ; Rose, Kenneth
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Abstract :
We propose an iterative optimization algorithm for the generic class of clustering-based indexing for approximate similarity searching. It was previously shown that clustering is a powerful component of approximate searching that reduces the number of retrieved data points. The objective of the proposed algorithm is to maximize the expected search quality given the query distribution. The problem is decomposed into minimization over three mapping functions, and fixed-point iterations of the algorithm alternately optimizing one mapping while fixing the other two. We demonstrate via experiments on real high dimensional data sets that the algorithm significantly improves the time/accuracy efficiency over heuristic clustering design.
Keywords :
indexing; information retrieval; iterative methods; minimisation; multimedia databases; search problems; approximate similarity searching; high dimensional data sets; iterative optimization algorithm; minimization; multimedia databases; optimal clustering; Algorithm design and analysis; Clustering algorithms; Costs; Data mining; Feature extraction; Indexing; Information retrieval; Iterative algorithms; Search engines; Vectors;
Conference_Titel :
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7803-7304-9
DOI :
10.1109/ICME.2002.1035655