DocumentCode :
2417366
Title :
Non-uniform partition strategies for indexing high-dimensional data with different distributions
Author :
Wang, Ben ; Gan, Qiang
Author_Institution :
Dept. of Comput. Sci., Essex Univ., Colchester, UK
fYear :
2003
fDate :
10-12 Dec. 2003
Firstpage :
13
Lastpage :
20
Abstract :
Efficient high-dimensional data indexing algorithms are crucial for image retrieval in large datasets. One of the state-of-the-art indexing methods is vector approximation file (VA-file), which indexes high-dimensional data by filtering feature vectors so that only a small fraction of them are visited in the search process. The VA-file uses a partition strategy that divides the data space on every dimension to make each partition equally full and assigns a same number of bits to each dimension. However, the strategy is not efficient to image datasets where the number of different vector components (granularity) in each dimension is largely diverse. The first two partition strategies are implemented in a practical way according to the description from the original VA-file method. The other two nonuniform partition strategies are proposed to resolve the problems of reduplicate coordinates and uniform bits assignment for each dimension, which assign more bits to represent dimensions with more vector components. Experimental results have shown that these strategies largely improve the performance of the VA-file for nonuniform datasets in terms of query time and filtering efficiency.
Keywords :
database indexing; image retrieval; very large databases; visual databases; dimension granularity; feature vector filtering efficiency; high-dimensional data indexing algorithm; image dataset; image retrieval; nonuniform partition strategy; query time; uniform bit assignment; vector approximation file; Clustering algorithms; Computer science; Degradation; Delay; Filtering; Gallium nitride; Image retrieval; Indexing; Information retrieval; Partitioning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Software Engineering, 2003. Proceedings. Fifth International Symposium on
Conference_Location :
Taichung, Taiwan
Print_ISBN :
0-7695-2031-6
Type :
conf
DOI :
10.1109/MMSE.2003.1254417
Filename :
1254417
Link To Document :
بازگشت