DocumentCode :
1873298
Title :
Improved text overlay detection in videos using a fusion-based classifier
Author :
Tseng, Belle L. ; Lin, Ching-Yung ; Zhang, DongQing ; Smith, John R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Volume :
3
fYear :
2003
fDate :
6-9 July 2003
Abstract :
In this paper, classifier fusion is adopted to demonstrate improved performance for our text overlay detections in the NIST TREC-2002 video retrieval benchmark. A normalized ensemble fusion is explored to combine two text overlay detection models. The fusion incorporates normalization of confidence scores, aggregation via combiner function, and an optimize selection. The proposed fusion classifier resulted best out of 11 detectors submitted to the NIST text overlay detection benchmarking and its average precision performance is 227% of the second best detector in the benchmark.
Keywords :
image retrieval; optimisation; sensor fusion; text analysis; video signal processing; combiner function; confidence scores; fusion-based classifier; optimize selection; text detectors; text overlay detection; video data sets; video retrieval benchmark; Content based retrieval; Detectors; Fusion power generation; Image color analysis; Image segmentation; Indexing; Layout; Motion analysis; NIST; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1221351
Filename :
1221351
Link To Document :
بازگشت