DocumentCode :
2504005
Title :
Unsupervised Evaluation Methods Based on Local Gray-Intensity Variances for Binarization of Historical Documents
Author :
Ramírez-Ortegón, Marte A. ; Rojas, Raúl
Author_Institution :
Inst. fur Inf., Freie Univ. Berlin, Berlin, Germany
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
2029
Lastpage :
2032
Abstract :
We attempt to evaluate the efficacy of six unsupervised evaluation method to tune Sauvola´s threshold in optical character recognition (OCR) applications. We propose local implementations of well-known measures based on gray-intensity variances. Additionally, we derive four new measures from them using the unbiased variance estimator and gray-intensity logarithms. In our experiment, we selected the well binarized images, according each measure, and computed the accuracy of the recognized text of each. The results show that the weighted and uniform variance (using logarithms) are suitable measures for OCR applications.
Keywords :
document image processing; optical character recognition; Sauvola threshold; gray-intensity variances; historical documents binarization; local gray intensity variances; optical character recognition; unbiased variance estimator; unsupervised evaluation method; Accuracy; Image segmentation; Libraries; Optical character recognition software; Pixel; Text recognition; Weight measurement; binarization; evaluation; unsupervised;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.500
Filename :
5597252
Link To Document :
بازگشت