DocumentCode
2148439
Title
New Binarization Approach Based on Text Block Extraction
Author
Ben Messaoud, Ines ; Amiri, Hamid ; El Abed, Haikal ; Margner, Volker
Author_Institution
Lab. des Syst. et Traitement de Signal (LSTS), Ecole Nat. d´´Ing. de Tunis (ENIT), Tunis, Tunisia
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
1205
Lastpage
1209
Abstract
Document analysis and recognition systems include, usually, several levels, annotation, preprocessing, segmentation, feature extraction, classification and post-processing. Each level may be dependent on or independent from the other levels. The presence of noise in images can affect the performance of the entire system. This noise can be introduced by the digitization step or from the document itself. In this paper, we present a new binarization approach based on a combination between a preprocessing step and a localization step. The aim of the present approach is the application of binarization algorithms on selected objects-of-interest. The evaluation of the developed approach is performed using two benchmarking datasets from the last two document binarization contests (DIBCO 2009 and H-DIBCO 2010). It shows very promising results.
Keywords
document image processing; feature extraction; interference suppression; text analysis; benchmarking datasets; digitization; document analysis; document binarization contests; document recognition; image preprocessing; localization; noise; text block extraction; Equations; Image edge detection; Mathematical model; Noise measurement; PSNR; Binarization evaluation; Document analysis; Document image binarization; Preprocessing;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location
Beijing
ISSN
1520-5363
Print_ISBN
978-1-4577-1350-7
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2011.243
Filename
6065501
Link To Document