DocumentCode :
3174254
Title :
A family of European page readers
Author :
Baird, Henry S. ; Ilbert, Derrickg ; Ittner, Davidj
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
2
fYear :
1994
fDate :
9-13 Oct 1994
Firstpage :
540
Abstract :
We have demonstrated a high degree of automation in the engineering of complex machine vision systems, by building ten printed-text page readers, each specialized to a European language, at the pace of one language per week. The page readers provide these functions: page layout analysis, polyfont symbol recognition, typographical morphology, lexicon-driven contextual analysis, and Unicode output encoding. The accuracy and speed of the resulting readers are usably high, and can be easily improved if required by comparatively routine enhancements of subsystems. This exercise illustrates the advantages of a research strategy that emphasizes versatility before, but not at the expense of, accuracy and speed
Keywords :
document image processing; European language; European page readers; Unicode output encoding; complex machine vision systems; lexicon-driven contextual analysis; page layout analysis; polyfont symbol recognition; printed-text page readers; typographical morphology; Automation; Computer architecture; Design engineering; Encoding; Machine vision; Morphology; Natural languages; Prototypes; Runtime; System software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1994. Vol. 2 - Conference B: Computer Vision & Image Processing., Proceedings of the 12th IAPR International. Conference on
Conference_Location :
Jerusalem
Print_ISBN :
0-8186-6270-0
Type :
conf
DOI :
10.1109/ICPR.1994.577014
Filename :
577014
Link To Document :
بازگشت