Title :
Data Independent Method of Constructing Distributed LSH for Large-Scale Dynamic High-Dimensional Indexing
Author :
Gu, Xiaoguang ; Zhang, Lei ; Zhang, Dongming ; Zhang, Yongdong ; Li, Jintao ; Bao, Ning
Author_Institution :
Inst. of Comput. Technol., Beijing, China
Abstract :
Constructing effective and efficient indexes for explosive growing multimedia data is a very challenging problem. To solve the problem, Haghani et al. provide a distributed similarity search method in high dimensions using Locality Sensitive Hashing. However, their method needs to estimate a global parameter on the whole dataset beforehand. It is impractical for a large-scale dynamical dataset. This paper proposes a novel constructing method of distributed LSH which does not need any priori knowledge about the dataset. Through generating the hash function with consistent output distribution, we get a data independent predicting model in theory which can guarantee a well load balance even if the dataset dynamically changes. Furthermore, we modify the query algorithm of the basic LSH to make the proposed model more practical. The experimental results on two open large-scale high-dimensional datasets show that the proposed method is more robust, scalable and practical than state-of-the-art.
Keywords :
file organisation; indexing; multimedia computing; query processing; resource allocation; search problems; data independent predicting model; distributed LSH construction; distributed similarity search method; global parameter estimation; hash function; indexing; load balance; locality sensitive hashing; multimedia data; query algorithm; Data models; Distributed databases; Gaussian distribution; Indexes; Load modeling; Peer to peer computing; Predictive models; Data Independent; Distributed Similarity Search; Locality Sensitive Hashing; Peer-to-Peer;
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
DOI :
10.1109/HPCC.2012.82