DocumentCode :
1834117
Title :
Augmented edit distance based temporal contiguity analysis for improved videotext recognition
Author :
Aradhye, Hrishikesh ; Dorai, Chitra
Author_Institution :
Ohio State Univ., Columbus, OH, USA
fYear :
2001
fDate :
2001
Firstpage :
275
Lastpage :
280
Abstract :
Videotext refers to text superimposed on video frames and it enables automatic content annotation and indexing of large video and image collections. Its importance is underscored by the fact that a videotext-based multimedia description scheme has recently been adopted into the MPEG-7 standard. A study of published work in the area of automatic videotext extraction and recognition reveals that, despite recent interest, a reliable general purpose video character recognition (VCR) system is yet to be developed. In our development of a VCR system designed specifically to handle the low resolution output from videotext extractors, we observed that raw VCR accuracies obtained using various classifiers including kernel space methods such as SVM, are inadequate for accurate video annotation. We propose an intelligent postprocessing mechanism that is supported by general data characteristics of this domain for VCR performance improvement. We describe temporal contiguity analysis, which works independently of the raw character recognition technique and works well even for moving videotext. This novel mechanism can be easily implemented in conjunction with VCR algorithms being developed elsewhere to offer the same performance gains. Experimental results on various video streams show notable improvements in recognition rates with our system incorporating a SVM-based recognition engine and temporal contiguity analysis
Keywords :
character recognition; database indexing; feature extraction; learning automata; multimedia databases; very large databases; video databases; viewdata; visual databases; MPEG-7 standard; SVM; augmented edit distance; automatic content annotation; character recognition; image collections; indexing; intelligent postprocessing; large video collections; low resolution output; multimedia description scheme; temporal contiguity analysis; video character recognition; video frames; videotext extractors; videotext recognition; Character recognition; Data mining; Indexing; Kernel; MPEG 7 Standard; Performance gain; Streaming media; Support vector machine classification; Support vector machines; Video recording;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
Conference_Location :
Cannes
Print_ISBN :
0-7803-7025-2
Type :
conf
DOI :
10.1109/MMSP.2001.962746
Filename :
962746
Link To Document :
بازگشت