• DocumentCode
    2988190
  • Title

    Distributed media indexing based on MPI and MapReduce

  • Author

    Mohamed, Hisham ; Marchand-Maillet, Stéphane

  • Author_Institution
    Comput. Vision & Multimedia Lab., Univ. of Geneva, Geneva, Switzerland
  • fYear
    2012
  • fDate
    27-29 June 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Web-scale digital assets comprise millions or billions of documents. Due to such increase, sequential algorithms cannot cope with this data, and parallel and distributed computing become the solution of choice. MapReduce is a programming model proposed by Google for scalable data processing. MapReduce is mainly applicable for data intensive algorithms. In contrast, The message passing interface (MPI) is suitable for high performance algorithms. This paper proposes an adapted structure of MapReduce programming model using MPI for multimedia indexing. Experimental results on a large number of text (XML) excerpts related to images from the ImageNet corpus indicate that our implementation achieved good speedup compared to the sequential version and the earlier versions of MapReduce using MPI. Extensions to index large-scale multimedia collections are discussed.
  • Keywords
    Internet; XML; message passing; multimedia systems; ImageNet corpus; MPI; MapReduce programming model; Web-scale digital asset; XML; distributed computing; distributed media indexing; large-scale multimedia; message passing interface; multimedia indexing; parallel computing; scalable data processing; sequential algorithm; Data models; Indexing; Libraries; Message passing; Multimedia communication; Programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
  • Conference_Location
    Annecy
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4673-2368-0
  • Electronic_ISBN
    1949-3983
  • Type

    conf

  • DOI
    10.1109/CBMI.2012.6269841
  • Filename
    6269841