DocumentCode :
3177712
Title :
Power-law transformation for enhanced recognition of born-digital word images
Author :
Kumar, Deepak ; Ramakrishnan, A.G.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
fYear :
2012
fDate :
22-25 July 2012
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we discuss the issues related to word recognition in born-digital word images. We introduce a novel method of power-law transformation on the word image for binarization. We show the improvement in image binarization and the consequent increase in the recognition performance of OCR engine on the word image. The optimal value of gamma for a word image is automatically chosen by our algorithm with fixed stroke width threshold. We have exhaustively experimented our algorithm by varying the gamma and stroke width threshold value. By varying the gamma value, we found that our algorithm performed better than the results reported in the literature. On the ICDAR Robust Reading Systems Challenge-1: Word Recognition Task on born digital dataset, as compared to the recognition rate of 61.5% achieved by TH-OCR after suitable pre-processing by Yang et. al. and 63.4% by ABBYY Fine Reader (used as baseline by the competition organizers without any preprocessing), we achieved 82.9% using Omnipage OCR applied on the images after being processed by our algorithm.
Keywords :
optical character recognition; ICDAR robust reading systems; TH-OCR; born digital dataset; born-digital word image enhanced recognition; fixed stroke width threshold; image binarization; omnipage OCR; optical character recognition; power-law transformation; Cameras; Character recognition; Engines; Image recognition; Optical character recognition software; Text analysis; Transforms; binarization; born-digital image; power-law transform; stroke width; word recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
Type :
conf
DOI :
10.1109/SPCOM.2012.6290009
Filename :
6290009
Link To Document :
بازگشت