DocumentCode
2014473
Title
Robust Binarization for Video Text Recognition
Author
Saidane, Zohra ; Garcia, Christophe
Author_Institution
Orange Lab., Cesson-Sevigne
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
874
Lastpage
879
Abstract
This paper presents an automatic binarization method for color text areas in images or videos, which is robust to complex background, low resolution or video coding artefacts. Based on a specific architecture of convolutional neural networks, the proposed system automatically learns how to perform binarization, from a training set of synthesized text images and their corresponding desired binary images, without making any assumptions or using tunable parameters. The proposed method is compared to state-of-the-art binarization techniques, with respect to Gaussian noise and contrast variations, demonstrating the robustness and the efficiency of our method. Text recognition experiments on a database of images extracted from video frames and web pages, with two classical OCRs applied on the obtained binary images show a strong enhancement of the recognition rate by more than 40%.
Keywords
Gaussian noise; character recognition; image colour analysis; neural nets; Gaussian noise; Web pages; automatic binarization method; binary images; convolutional neural networks; robust binarization; state-of-the-art binarization techniques; video coding artefacts; video text recognition; Convolutional codes; Gaussian noise; Image databases; Image resolution; Network synthesis; Neural networks; Noise robustness; Text recognition; Video coding; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4377040
Filename
4377040
Link To Document