Title :
Cut classification for segmentation
Author :
Bayer, Thomas A. ; Kressel, U.-G.
Author_Institution :
Inf. Technol., Daimler-Benz AG, Ulm, Germany
Abstract :
In optical character recognition (OCR) and document analysis, many reading errors are not caused by inadequate classifier power, but by segmentation errors. In particular, merged characters are a major remaining problem. An efficient and powerful method of determining cut hypotheses for the segmentation of merged characters is presented. The method is based on a classifier deciding for each column of the character image, whether it represents a cut hypothesis or not. Since in the training phase the classifier is adapted by a sample set consisting of images of merged character patterns, the decision rules are created automatically rather than being man-made heuristics. The results obtained from a large test set show that a high recognition rate can be achieved with a reasonable computational effort
Keywords :
classification; feature extraction; image segmentation; optical character recognition; OCR; character image; classifier; cut classification; cut hypotheses; decision rules; document analysis; large test set; merged characters; optical character recognition; sample set; segmentation errors; training phase; Automatic testing; Character recognition; Error analysis; Image segmentation; Information analysis; Information processing; Optical character recognition software; Particle beam optics; System testing; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395672