DocumentCode :
433079
Title :
Over-complete representation and fusion for semantic concept detection
Author :
Natsev, Apostol ; Naphade, Milind R. ; Smith, John R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Volume :
4
fYear :
2004
fDate :
24-27 Oct. 2004
Firstpage :
2375
Abstract :
Automatic semantic concept detection in images is a promising tool for alleviating the user effort in annotating and cataloging digital media collections. It enables automatic identification of people, places and objects, for enhanced indexing and searching of home photographs, for example. While constructing robust semantic detectors has been shown feasible for global generic concepts with a sufficient number of good training examples (e.g., indoors, outdoors), many interesting concepts, such as face, people, occur at subpicture granularity, occupy only a portion of the image and therefore frequently have training examples with a reduced signal-to-noise ratio. Such regional concepts are harder to detect due to imperfections in automatic image segmentation algorithms leading to inaccurate object boundaries and low-level feature ambiguities. In this paper we focus on the problem of boosting detection performance of existing regional concept detectors by exploiting detection redundancy. Specifically, we propose to use the same detector multiple times to evaluate and combine multiple detection hypotheses for the same content-but at different content granularities-in order to reduce detection sensitivity to segmentation errors. We validate the approach using support vector machine classifiers for 14 regional semantic concepts from the NISTTRFCVID 2003 common annotation lexicon and show performance improvements of multigranular detection and fusion.
Keywords :
feature extraction; image classification; image enhancement; image representation; image segmentation; indexing; support vector machines; automatic image segmentation algorithm; boosting detection performance; cataloging digital media collection; enhanced indexing-searching; generic concept; home photographs; lexicon annotation; low-level feature ambiguities; multigranular detection-fusion; over-complete representation; semantic concept detection; signal-to-noise ratio; subpicture granularity; support vector machine classifier; Boosting; Detectors; Face detection; Image segmentation; Indexing; Object detection; Redundancy; Robustness; Signal to noise ratio; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2004. ICIP '04. 2004 International Conference on
ISSN :
1522-4880
Print_ISBN :
0-7803-8554-3
Type :
conf
DOI :
10.1109/ICIP.2004.1421578
Filename :
1421578
Link To Document :
بازگشت