Title :
Distributed media indexing based on MPI and MapReduce
Author :
Mohamed, Hisham ; Marchand-Maillet, Stéphane
Author_Institution :
Comput. Vision & Multimedia Lab., Univ. of Geneva, Geneva, Switzerland
Abstract :
Web-scale digital assets comprise millions or billions of documents. Due to such increase, sequential algorithms cannot cope with this data, and parallel and distributed computing become the solution of choice. MapReduce is a programming model proposed by Google for scalable data processing. MapReduce is mainly applicable for data intensive algorithms. In contrast, The message passing interface (MPI) is suitable for high performance algorithms. This paper proposes an adapted structure of MapReduce programming model using MPI for multimedia indexing. Experimental results on a large number of text (XML) excerpts related to images from the ImageNet corpus indicate that our implementation achieved good speedup compared to the sequential version and the earlier versions of MapReduce using MPI. Extensions to index large-scale multimedia collections are discussed.
Keywords :
Internet; XML; message passing; multimedia systems; ImageNet corpus; MPI; MapReduce programming model; Web-scale digital asset; XML; distributed computing; distributed media indexing; large-scale multimedia; message passing interface; multimedia indexing; parallel computing; scalable data processing; sequential algorithm; Data models; Indexing; Libraries; Message passing; Multimedia communication; Programming;
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
Conference_Location :
Annecy
Print_ISBN :
978-1-4673-2368-0
Electronic_ISBN :
1949-3983
DOI :
10.1109/CBMI.2012.6269841