DocumentCode
3058316
Title
A character segmentation method for Japanese printed documents coping with touching character problems
Author
Ariyoshi, Shunji
Author_Institution
Res. & Dev. Center, Toshiba Corp., Kanagawa, Japan
fYear
1992
fDate
30 Aug-3 Sep 1992
Firstpage
313
Lastpage
316
Abstract
Proposes a character segmentation method for Japanese printed documents. Since character segmentation is a kind of a search problem, avoiding `combinatorial explosion´ is essential in realizing practical systems. Segmentation is very complicated especially when characters touch each other. The method described gives a multi-stage algorithm, where the earlier stages treat more reliable segmentation than the later stages which utilize information obtained from the results of earlier stages. Segmentation hypotheses are generated in each stage on the basis of the results of earlier stages, and they are verified by the character recognition results. Experiments on more than one hundred documents have proven that this method is efficient and accurate for practical applications
Keywords
character recognition; document image processing; image segmentation; Japanese printed documents; character recognition; character segmentation; multistage algorithm; touching character problems; Books; Character recognition; Dynamic programming; Optical character recognition software; Research and development; Search problems; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location
The Hague
Print_ISBN
0-8186-2915-0
Type
conf
DOI
10.1109/ICPR.1992.201780
Filename
201780
Link To Document