Title :
Two-stage box connectivity algorithm for optical character recognition
Author :
Krtolica, Radovan ; Malitsky, Sophia
Author_Institution :
Canon Res. Center America, Inc., Palo Alto, CA, USA
Abstract :
The box connectivity approach (BCA) uses a regular grid to partition the character bitmap into n × n boxes. The bitmap is then represented by a graph whose vertices and edges correspond to the boxes and their connectivity. The adjacency matrices of the graphs are represented by two binary matrices of lower size: one for the vertical and one for the horizontal connections. A third binary matrix is used to represent pixel densities in each of the boxes. Hamming distances are used for multicriterion classification of the corresponding binary vectors: only noninferior (Pareto optimal) vectors are retained. Size of the character image and first order Markov chain model of the adjacent characters are used to disambiguate noninferior characters in the second stage of the algorithm
Keywords :
Markov processes; graph theory; matrix algebra; optical character recognition; probability; 2-stage box connectivity algorithm; BCA; Hamming distances; Pareto optimal; adjacency matrices; binary matrices; binary vectors; box connectivity approach; character bitmap; character image; first order Markov chain model; graph; multicriterion classification; noninferior characters; optical character recognition; pixel densities; regular grid; Character recognition; Computational efficiency; Degradation; Multi-stage noise shaping; Nearest neighbor searches; Noise robustness; Optical character recognition software; Optical noise; Partitioning algorithms; Shape;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395754