DocumentCode :
268376
Title :
On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval
Author :
Costa Pereira, José ; Coviello, Emanuele ; Doyle, Gabriel ; Rasiwasia, Nikhil ; Lanckriet, Gert R. G. ; Levy, R. ; Vasconcelos, Nuno
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, San Diego, La Jolla, CA, USA
Volume :
36
Issue :
3
fYear :
2014
fDate :
Mar-14
Firstpage :
521
Lastpage :
535
Abstract :
The problem of cross-modal retrieval from multimedia repositories is considered. This problem addresses the design of retrieval systems that support queries across content modalities, for example, using an image to search for texts. A mathematical formulation is proposed, equating the design of cross-modal retrieval systems to that of isomorphic feature spaces for different content modalities. Two hypotheses are then investigated regarding the fundamental attributes of these spaces. The first is that low-level cross-modal correlations should be accounted for. The second is that the space should enable semantic abstraction. Three new solutions to the cross-modal retrieval problem are then derived from these hypotheses: correlation matching (CM), an unsupervised method which models cross-modal correlations, semantic matching (SM), a supervised technique that relies on semantic representation, and semantic correlation matching (SCM), which combines both. An extensive evaluation of retrieval performance is conducted to test the validity of the hypotheses. All approaches are shown successful for text retrieval in response to image queries and vice versa. It is concluded that both hypotheses hold, in a complementary form, although evidence in favor of the abstraction hypothesis is stronger than that for correlation.
Keywords :
image matching; image representation; image retrieval; text analysis; unsupervised learning; CM; SCM; SM; content modalities; correlation matching; cross-modal multimedia retrieval; cross-modal retrieval problem; image queries; isomorphic feature spaces; low-level cross-modal correlations; mathematical formulation; multimedia repositories; query support; retrieval performance; retrieval systems design; semantic abstraction; semantic correlation matching; semantic matching; semantic representation; supervised technique; text retrieval; unsupervised method; Correlation; Databases; Hidden Markov models; Joints; Multimedia communication; Semantics; Vectors; Multimedia; content-based retrieval; cross-modal; image and text; kernel correlation; logistic regression; multimodal; retrieval model; semantic spaces;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2013.142
Filename :
6573933
Link To Document :
بازگشت