Title :
Character pattern extraction from colorful documents with complex backgrounds
Author :
Goto, Hideaki ; Aso, Hirotomo
Author_Institution :
Inf. Synergy Center, Tohoku Univ., Sendai, Japan
Abstract :
Today there are lots of documents in which text characters are printed on colored and/or complex backgrounds. We previously proposed a character pattern extraction method by which character patterns can be extracted from grayscale document images with complex background. The method has unique, advantageous properties; it is capable of extracting very small characters and is tolerant of shadings of images. However, the method did not work well for some color documents since it lacks the ability of discriminating color difference. This paper proposes an enhanced version of the method, which utilizes the local color segmentation and the region growing. The experimental results have shown that the new method yields much better results for color magazine covers without sacrificing the performance of extracting small character patterns. The method is tolerant of shadings of images as well.
Keywords :
document image processing; image colour analysis; image segmentation; optical character recognition; character pattern extraction; color difference; colorful documents; complex backgrounds; experimental results; grayscale document images; image shadings; local color segmentation; region growing; Brightness; Cameras; Character recognition; Color; Computer graphics; Data mining; Gray-scale; Image segmentation; Text analysis; Text recognition;
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-1695-X
DOI :
10.1109/ICPR.2002.1047824