Title :
Bounded LSH for Similarity Search in Peer-to-Peer File Systems
Author :
Hua, Yu ; Bin Xiao ; Feng, Dan ; Yu, Bo
Author_Institution :
Huazhong Univ. of Sci. & Technol., Wuhan
Abstract :
Similarity search has been widely studied in peer-to-peer environments. In this paper, we propose the Bounded Locality Sensitive Hashing (Bounded LSH) method for similarity search in P2P file systems. Compared to the basic Locality Sensitive Hashing (LSH), Bounded LSH makes improvement on the space saving and quick query response in the similarity search, especially for high-dimensional data objects that exhibit non-uniform distribution property. We present simple and space-efficient Bounded-LSH to map non-uniform data space into load-balanced hash buckets that contain approximate number of objects. Load-balanced hash buckets in Bounded-LSH, in turn, require less number of hash tables while maintaining a high probability of returning the closest objects to requests. Our experiments based on synthetic and real-world datasets showed the feasibility, query and space efficiency of our proposed method.
Keywords :
file organisation; peer-to-peer computing; query processing; resource allocation; bounded LSH; high-dimensional data objects; load balanced hash buckets; locality sensitive hashing; non uniform distribution property; peer-to-peer file systems; quick query response; similarity search; Clustering algorithms; Data structures; File systems; Image sensors; Indexing; Nearest neighbor searches; Parallel processing; Partitioning algorithms; Peer to peer computing; Videos;
Conference_Titel :
Parallel Processing, 2008. ICPP '08. 37th International Conference on
Conference_Location :
Portland, OR
Print_ISBN :
978-0-7695-3374-2
Electronic_ISBN :
0190-3918
DOI :
10.1109/ICPP.2008.25