مرکز منطقه ای اطلاع رساني علوم و فناوري - Text-to-image retrieval based on incremental association via multimodal hypernetworks

DocumentCode :

2477116

Title :

Text-to-image retrieval based on incremental association via multimodal hypernetworks

Author :

Jung-Woo Ha ; Beom-Jin Lee ; Byoung-Tak Zhang

Author_Institution :

Sch. of Comp. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea

fYear :

2012

fDate :

14-17 Oct. 2012

Firstpage :

3245

Lastpage :

3250

Abstract :

Text-to-image retrieval is to retrieve the images associated with the textual queries. A text-to-image retrieval model requires an incremental learning method for its practical use since the multimodal data grow up dramatically. Here we propose an incremental text-to-image retrieval method using a multimodal association model. The association model is based on a hypernetwork (HN) where a vertex corresponds to a textual word or a visual patch and a hyperedge represents a higher-order multimodal association. Using the HN incrementally learned by a sequential Bayesian sampling, in the multimodal hypernetwork-based text-to-image retrieval, a given text query is crossmodally expanded to the visual query and then similar images are retrieved to the expanded visual query. We evaluated the proposed method using 3,000 images with textual description from Flickr.com. The experimental results present that the proposed method achieves very competitive retrieval performances compared to a baseline method. Moreover, we demonstrate that our method provides robust text-to-image retrieval results for the increasing data.

Keywords :

Bayes methods; image retrieval; learning (artificial intelligence); network theory (graphs); sampling methods; vertex functions; Flickr.com; higher-order multimodal association; hyperedge; incremental association; incremental learning method; incremental text-to-image retrieval method; multimodal association model; multimodal data; multimodal hypernetwork; sequential Bayesian sampling; textual query; vertex; visual patch; visual query; Boats; Data models; Educational institutions; Feature extraction; Image retrieval; Training; Visualization; hypernetworks; incremetnal learning; text-to-image retrieval; textual-visual association;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on

Conference_Location :

Seoul

Print_ISBN :

978-1-4673-1713-9

Electronic_ISBN :

978-1-4673-1712-2

Type :

conf

DOI :

10.1109/ICSMC.2012.6378291

Filename :

6378291

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2477116