DocumentCode
2832987
Title
A system for reading USA census ´90 hand-written fields
Author
Simoncini, L. ; Kovacs, Z.M.
Author_Institution
Dipartimento di Elettronica, Inf. e Sistemistica, Bologna Univ., Italy
Volume
1
fYear
1995
fDate
14-16 Aug 1995
Firstpage
86
Abstract
A specifically designed recognition system is presented for reading the Industry or Employer section of the Official 1990 U.S. Census form. This system was a participant in the 2nd Census OCR Systems Conference, organised by the U.S. Bureau of Census and the National Institute of Standards and Technology. It handles the complete reading task starting from the scanned raster image of the form arriving to an ASCII string containing what was written in the fields of the original form. The system is based on several building blocks, connected in a suitable way. In particular, the main operations of the recognition engine are: form identification, field isolation and bounding box removal, field and blob segmentation, broken character joining, isolated character recognition, word building, dictionary correction and finally, hypothesis and confidence generation. Each processing step is described an detail. Results on the NIST Special Database 13 are also reported
Keywords
government data processing; handwriting recognition; optical character recognition; standards; ASCII string; NIST Special Database; USA census ´90 handwritten fields reading; blob segmentation; bounding box removal; broken character joining; complete reading task; confidence generation; dictionary correction; field isolation; form identification; isolated character recognition; scanned raster image; word building; Character generation; Character recognition; Dictionaries; Digital images; Engines; Image databases; NIST; Office automation; Optical character recognition software; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-8186-7128-9
Type
conf
DOI
10.1109/ICDAR.1995.598950
Filename
598950
Link To Document