• DocumentCode
    3648134
  • Title

    Distributed high-dimensional index creation using Hadoop, HDFS and C++

  • Author

    Gylfi Þór Gudmundsson;Laurent Amsaleg;Björn Þór Jónsson

  • Author_Institution
    INRIA, Rennes, France
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.
  • Keywords
    "Vectors","Indexing","Merging","Programming","Clustering algorithms","Hardware"
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4673-2368-0
  • Electronic_ISBN
    1949-3991
  • Type

    conf

  • DOI
    10.1109/CBMI.2012.6269848
  • Filename
    6269848