DocumentCode :
382286
Title :
Robust video text segmentation and recognition with multiple hypotheses
Author :
Dobez, J.-M. ; Chen, Datong
Author_Institution :
IDIAP, Switzerland
Volume :
2
fYear :
2002
fDate :
2002
Abstract :
A method for segmenting and recognizing text embedded in video and images is proposed. Multiple segmentation of the same text region is performed, thus producing multiple hypotheses of binary text images. The segmentation algorithm is stated as a statistical labeling and is based on a Markov random field (MRF) model of the label map. Background regions in each hypothesis are then removed by performing a connected component analysis and by enforcing a more stringent constraint (called GCC - grayscale consistency constraint) on the text characters´ grayscale values using a robust 1D-median operator. Each text image hypothesis is then processed by optical character recognition (OCR) software. The final result is then selected from the set of output strings. Results show that both the use of multiple hypotheses and the GCC significantly improve the results.
Keywords :
Markov processes; feature extraction; image segmentation; optical character recognition; text analysis; video signal processing; 1D-median operator; Markov random field; binary text images; connected component analysis; grayscale consistency constraint; image segmentation; multiple hypotheses; optical character recognition; statistical labeling; video segmentation; video text segmentation; Character recognition; Gray-scale; Image recognition; Image segmentation; Labeling; Markov random fields; Optical character recognition software; Performance analysis; Robustness; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing. 2002. Proceedings. 2002 International Conference on
ISSN :
1522-4880
Print_ISBN :
0-7803-7622-6
Type :
conf
DOI :
10.1109/ICIP.2002.1039980
Filename :
1039980
Link To Document :
بازگشت