Title :
High-speed, high-accuracy binarization method for recognizing text in images of low spatial resolutions
Author :
Kamada, Hiroshi ; Fujimoto, Katsuhito
Author_Institution :
Fujitsu Labs. Ltd., Kawasaki, Japan
Abstract :
We propose a new high-speed, high-accuracy binarization method for recognizing text in document images. First character neighborhoods are extracted from input images using a global thresholding value that is shifted to the background pixel value from the thresholding value of conventional global binarization. Second, characters are extracted using an original local binarization process integrated with image interpolation. Our method takes only 1/100 the processing time over the method that performs image interpolation first. Therefore our method binarizes an A4 size text image (150dpi) in an average of only 3.3 seconds using a 166 MHz Pentium processor. Furthermore, our method reduced unrecognized characters by 46.5%, compared with conventional global binarization
Keywords :
document image processing; image resolution; interpolation; optical character recognition; Pentium processor; character neighborhoods; document image processing; global thresholding value; high-accuracy binarization method; image interpolation; image resolution; low spatial resolutions; text recognition; Character recognition; Color; Gray-scale; Image recognition; Image resolution; Interpolation; Pixel; Skeleton; Spatial resolution; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
DOI :
10.1109/ICDAR.1999.791744