DocumentCode :
2630501
Title :
From pixels to paragraphs: The use of contextual models in text recognition
Author :
Srihari, Sargur N.
Author_Institution :
Center of Excellence for Document Analysis & Recognition, State Univ. of New York, Buffalo, NY, USA
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
416
Lastpage :
423
Abstract :
Several hierarchical levels in modeling text are discussed. At each level, the model is typically based on elements at the next lower level, e.g., characters in terms of features, words in terms of characters, phrases/sentences in terms of words, and paragraphs in terms of sentences. Such models are referred to as analytic, since they are based on step-by-step analysis. Although such models are usually sufficient, in order to achieve system robustness it is necessary to use models that skip a level, e.g., words in terms of features/pixels and sentences in terms of characters. Such models are holistic in that decisions about entities are made in the context of decisions about other entities. Holistic methods are slower than analytic methods and do not necessarily perform better. However, when analytic and holistic methods are combined, overall system performance is higher than either alone. Recognition algorithms based on different models can be combined to achieve robustness
Keywords :
character recognition; document image processing; contextual models; hierarchical levels; step-by-step analysis; system robustness; text recognition; Character recognition; Context modeling; Gray-scale; Image analysis; Image converters; Image recognition; Image storage; Optical character recognition software; Text analysis; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395704
Filename :
395704
Link To Document :
بازگشت