DocumentCode :
3485728
Title :
Color Drop-Out Binarization Method for Document Images with Color Shift
Author :
Seki, Morihiro ; Asano, Eisuke ; Yasue, Tsuneo ; Nagayoshi, Hiroto ; Shinjo, Hiroshi ; Nagasaki, Takeshi
Author_Institution :
Central Res. Lab., Hitachi Ltd., Kokubunji, Japan
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
123
Lastpage :
127
Abstract :
A novel method using "color drop-out" for document images with "color shift" is proposed. Color shift phenomena sometimes occur in document images captured by a camera device or stand type scanner. It adversely affects the binarization and character recognition processes, because it generates pseudo color pixels on scanned image, which do not exist on the original document. To solve the "pseudo color problem," a binarization method based on the following three calculation steps is proposed. First, line and character areas are estimated coarsely by using form structure analysis and subtracting background from images, second, the color shift is removed by using morphological processing, third, each pixel of the background subtracted images is discriminated into character strings and lines precisely by dynamic color classification. Several character recognition experiments using low quality form samples (in which handwritten strings overlap form lines and preprints) were performed. According to the experimental results, the proposed method attains character recognition accuracy of 94.3%, which is 5.2pt. higher than that attained by a conventional method.
Keywords :
cameras; document image processing; image classification; image colour analysis; mathematical morphology; background subtracted image pixel; camera device; character areas; character lines; character recognition accuracy; character recognition processes; character strings; color drop-out binarization method; color shift phenomena; document images; dynamic color classification; form structure analysis; handwritten string; line areas; morphological processing; preprints; pseudocolor pixel generation; pseudocolor problem; scanned image; stand-type scanner; Accuracy; Character recognition; Color; Data mining; Image color analysis; Image recognition; Morphology; binarization; color shift; form; morphology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.32
Filename :
6628597
Link To Document :
بازگشت