Title :
Distributed high-dimensional index creation using Hadoop, HDFS and C++
Author :
Gylfi Þór Gudmundsson;Laurent Amsaleg;Björn Þór Jónsson
Author_Institution :
INRIA, Rennes, France
fDate :
6/1/2012 12:00:00 AM
Abstract :
This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.
Keywords :
"Vectors","Indexing","Merging","Programming","Clustering algorithms","Hardware"
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
Print_ISBN :
978-1-4673-2368-0
Electronic_ISBN :
1949-3991
DOI :
10.1109/CBMI.2012.6269848