DocumentCode :
3518580
Title :
Impact of novel sources on content-based image and video retrieval
Author :
Ghoshal, Arnab ; Khudanpur, Sanjeev ; Klakow, Dietrich
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
1937
Lastpage :
1940
Abstract :
The problem of content-based image and video retrieval with textual queries is often posed as that of visual concept classification, where classifiers for a set of predetermined visual concepts are trained using a set of manually annotated images. Such a formulation implicitly assumes that the training data has similar distributional characteristics as that of the data which need to be indexed. In this paper we demonstrate empirically that even within the relatively narrow domain of news videos collected from a variety of news programs and broadcasters, the assumption of distributional similarity of visual features does not hold across programs from different broadcasters. This is manifested in considerable degradation of ranked retrieval performance on novel sources. We observe that concepts whose spatial locations remain relatively fixed between various sources are also more robust to source mismatches, and vice versa. We also show that a simple averaging of multiple visual detectors is more robust than any of the individual detectors. Furthermore, we show that for certain sources using only 20% of the available annotated data can bridge roughly 80% of the performance drop, while others can require larger amounts of annotated data.
Keywords :
content-based retrieval; image classification; video retrieval; content-based image retrieval; content-based video retrieval; multiple visual detector; predetermined visual concepts; ranked retrieval performance; textual queries; visual concept classification; Broadcasting; Content based retrieval; Degradation; Detectors; Hidden Markov models; Image retrieval; Information retrieval; Natural languages; Robustness; Speech processing; Information retrieval; Multimedia systems; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959989
Filename :
4959989
Link To Document :
بازگشت