Title :
HCube: A Server-centric Data Center Structure for Similarity Search
Author :
da Silva Villaca, R. ; Pasquini, R. ; de Paula, L.B. ; Magalhaes, M.F.
Author_Institution :
Sch. of Electr. & Comput. Eng., UNICAMP, Campinas, Brazil
Abstract :
The information society is facing a sharp increase in the amount of information driven by the plethora of new applications that sprouts all the time. The amount of data now circulating on the Internet is over zettabytes (ZB), resulting in a scenario defined in the literature as Big Data. In order to handle such challenging scenario, the deployed solutions rely not only on massive storage, memory and processing capacity installed in Data Centers (DC) maintained by big players all over the globe, but also on shrewd computational techniques, such as Big Table, MapReduce and Dynamo. In this context, this work presents a DC structure designed to support the similarity search. The proposed solution aims at concentrating similar data on servers physically close within a DC, accelerating the recovery of all data related to searches performed using a primitive get(k, sim), in which k represents the query identifier, i.e., the data used as reference, and sim a similarity level.
Keywords :
Internet; computer centres; query processing; BigTable; HCube; Internet; MapReduce; ZB big data; dynamo; information society; query identifier; server-centric data center structure; similarity search; zettabytes; Big data; Hamming distance; Organizations; Reflective binary codes; Routing; Servers; Vectors; Big Data; Data Center; Hamming similarity; Similarity Search;
Conference_Titel :
Advanced Information Networking and Applications (AINA), 2013 IEEE 27th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-5550-6
Electronic_ISBN :
1550-445X
DOI :
10.1109/AINA.2013.139