Title :
Connected and degraded text recognition using hidden Markov model
Author :
Bose, Chinmoy B. ; Kuo, Shyh-shiaw
Author_Institution :
Signal Processing Res. Dept., AT&T Bell Labs., Murray Hill, NJ, USA
fDate :
30 Aug-3 Sep 1992
Abstract :
The authors apply a hidden Markov model (HMM) and a level-building dynamic programming algorithm to the problem of robust machine recognition of connected and degraded characters forming words in a poorly printed text. A structural analysis algorithm is used to segment a word into sub-character segments irrespective of the character boundaries, and to identify the primitive features in each segment such as strokes and arcs. The states of the HMM for each character are statistically represented by the sub-character segments and the state characteristics are obtained by determining the state probability functions based on the training samples. A level-building dynamic programming algorithm combines word-segmentation and recognition in one operation and chooses the best probable grouping of characters for recognition of an unknown word. The computer experiments demonstrate the robustness and effectiveness of the system for recognizing words formed by degraded and connected characters
Keywords :
Markov processes; character recognition; dynamic programming; image segmentation; arcs; connected text; degraded text recognition; hidden Markov model; level-building dynamic programming algorithm; poorly printed text; primitive feature identification; segmentation; state probability functions; strokes; structural analysis; sub-character segments; word-segmentation; Character recognition; Degradation; Dynamic programming; Feature extraction; Heuristic algorithms; Hidden Markov models; Image segmentation; Robustness; Signal processing algorithms; Text recognition;
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
DOI :
10.1109/ICPR.1992.201734