DocumentCode :
3648134
Title :
Distributed high-dimensional index creation using Hadoop, HDFS and C++
Author :
Gylfi Þór Gudmundsson;Laurent Amsaleg;Björn Þór Jónsson
Author_Institution :
INRIA, Rennes, France
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
1
Lastpage :
6
Abstract :
This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.
Keywords :
"Vectors","Indexing","Merging","Programming","Clustering algorithms","Hardware"
Publisher :
ieee
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
ISSN :
1949-3983
Print_ISBN :
978-1-4673-2368-0
Electronic_ISBN :
1949-3991
Type :
conf
DOI :
10.1109/CBMI.2012.6269848
Filename :
6269848
Link To Document :
بازگشت