DocumentCode :
1724229
Title :
Hamming DHT: Taming the similarity search
Author :
da Silva Villaca, R. ; de Paula, L.B. ; Pasquini, R. ; Magalhaes, M.F.
Author_Institution :
Sch. of Electr. & Comput. Eng., UNICAMP, Campinas, Brazil
fYear :
2013
Firstpage :
7
Lastpage :
12
Abstract :
The semantic meaning of a content is frequently represented by content vectors in which each dimension represents an attribute of this content, such as, keywords in a text, colors in a picture or profile information in a social network. However, one important challenge in this semantic context is the storage and retrieval of similar contents, such as the search for similar images assisting a medical procedure. Based on it, this paper presents a new Distributed Hash Table (DHT), called Hamming DHT, in which Locality Sensitive Hashing (LSH) functions, specially the Random Hyperplane Hashing (RHH), are used to generate content identifiers, propitiating a scenario in which similar contents are stored in peers nearly located in the indexing space of the proposed DHT. The evaluations of this work simulate profiles in a social network to verify if the proposed DHT is capable of reducing the number of hops required in order to improve the recall in the context of a similarity search.
Keywords :
Hamming codes; cryptography; social networking (online); Hamming DHT; LSH functions; RHH; distributed hash table; locality sensitive hashing function; random hyperplane hashing; semantic content vectors; similarity search; social network; Hamming distance; Organizations; Peer-to-peer computing; Proposals; Reflective binary codes; Social network services; Vectors; Cosine distance; DHT; Gray codes; Hamming distance; LSH; RHH; Similarity Search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Communications and Networking Conference (CCNC), 2013 IEEE
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-3131-9
Type :
conf
DOI :
10.1109/CCNC.2013.6488418
Filename :
6488418
Link To Document :
بازگشت