DocumentCode
3648134
Title
Distributed high-dimensional index creation using Hadoop, HDFS and C++
Author
Gylfi Þór Gudmundsson;Laurent Amsaleg;Björn Þór Jónsson
Author_Institution
INRIA, Rennes, France
fYear
2012
fDate
6/1/2012 12:00:00 AM
Firstpage
1
Lastpage
6
Abstract
This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.
Keywords
"Vectors","Indexing","Merging","Programming","Clustering algorithms","Hardware"
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
ISSN
1949-3983
Print_ISBN
978-1-4673-2368-0
Electronic_ISBN
1949-3991
Type
conf
DOI
10.1109/CBMI.2012.6269848
Filename
6269848
Link To Document