Title :
Two-stage sparse graph construction using MinHash on MapReduce
Author :
Hsieh, Liang-Chi ; Wu, Guan-Long ; Lee, Wen-Yu ; Hsu, Winston
Author_Institution :
Grad. Inst. of Networking & Multimedia, Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Image graph attracts attention from researchers due to the empirical success of graph based semi-supervised learning (SSL) methods and tasks such as image clustering, image navigation. Despite its simple structure, overwhelming scale of online images makes image graph construction a difficult problem. The challenge lies in time-consuming computation, and the difficulty of storing, processing the resulted graphs of huge size. We propose a novel method of image graph construction on MapReduce for large-scale data. The method consists of two stages: the first stage separates images into overlapping groups called image pools by using hash method, and the second computes pairwise similarities for pairs of images that are grouped into common pools. Both stages are performed on MapReduce. Our experiments on large-scale data show that the proposed method generates more sparse image graphs that reserve same or improved accuracy when comparing with previous method.
Keywords :
graph theory; image processing; learning (artificial intelligence); MapReduce; MinHash; graph based semisupervised learning; image clustering; image graph construction; image navigation; online images; pairwise similarities; simple structure; sparse image graphs; two-stage sparse graph construction; Accuracy; Approximation algorithms; Educational institutions; Multimedia communication; Navigation; Semisupervised learning; Visualization; Image graph; hash; sparse graph;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288057