DocumentCode :
2477116
Title :
Text-to-image retrieval based on incremental association via multimodal hypernetworks
Author :
Jung-Woo Ha ; Beom-Jin Lee ; Byoung-Tak Zhang
Author_Institution :
Sch. of Comp. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
3245
Lastpage :
3250
Abstract :
Text-to-image retrieval is to retrieve the images associated with the textual queries. A text-to-image retrieval model requires an incremental learning method for its practical use since the multimodal data grow up dramatically. Here we propose an incremental text-to-image retrieval method using a multimodal association model. The association model is based on a hypernetwork (HN) where a vertex corresponds to a textual word or a visual patch and a hyperedge represents a higher-order multimodal association. Using the HN incrementally learned by a sequential Bayesian sampling, in the multimodal hypernetwork-based text-to-image retrieval, a given text query is crossmodally expanded to the visual query and then similar images are retrieved to the expanded visual query. We evaluated the proposed method using 3,000 images with textual description from Flickr.com. The experimental results present that the proposed method achieves very competitive retrieval performances compared to a baseline method. Moreover, we demonstrate that our method provides robust text-to-image retrieval results for the increasing data.
Keywords :
Bayes methods; image retrieval; learning (artificial intelligence); network theory (graphs); sampling methods; vertex functions; Flickr.com; higher-order multimodal association; hyperedge; incremental association; incremental learning method; incremental text-to-image retrieval method; multimodal association model; multimodal data; multimodal hypernetwork; sequential Bayesian sampling; textual query; vertex; visual patch; visual query; Boats; Data models; Educational institutions; Feature extraction; Image retrieval; Training; Visualization; hypernetworks; incremetnal learning; text-to-image retrieval; textual-visual association;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
Type :
conf
DOI :
10.1109/ICSMC.2012.6378291
Filename :
6378291
Link To Document :
بازگشت