Title :
Data-driven image captioning with meta-class based retrieval
Author :
Kilickaya, M. ; Erdem, Esra ; Erdem, A Tanju ; Cinbis, Nazli Ikizler ; Cakici, Ruket
Author_Institution :
Bilgisayar Muhendisligi Bolumu, Hacettepe Univ., Ankara, Turkey
Abstract :
Automatic image captioning, the process of producing a description for an image, is a very challenging problem which has only recently received interest from the computer vision and natural language processing communities. In this study, we present a novel data-driven image captioning strategy which, for a given image, finds the most visually similar image in a large dataset of image-caption pairs and transfers its caption as the description of the input image. Our novelty lies in employing a recently proposed high-level global image representation, named the meta-class descriptor, to better capture the semantic content of the input image for use in the retrieval process. Our experiments show that, as compared to the baseline Im2Text model, our meta-class guided approach produces more accurate descriptions.
Keywords :
computer vision; image representation; image retrieval; natural language processing; automatic data-driven image captioning strategy; computer vision community; high-level global image representation; image-caption pairs; image-to-text; input image semantic content; meta-class based retrieval; meta-class descriptor; natural language processing community; Cameras; Clouds; Conferences; Face; Marine vehicles; Senior citizens; Signal processing; image captioning; image-to-text;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location :
Trabzon
DOI :
10.1109/SIU.2014.6830631