DocumentCode
3057945
Title
Text data extraction from microfilm images of punched cards
Author
Kumar, Sudha U. ; Kasturi, Rangachar
Author_Institution
Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
fYear
1992
fDate
30 Aug-3 Sep 1992
Firstpage
230
Lastpage
233
Abstract
A system for reading text data from microfilm images of punched cards is described. The input is a high resolution gray level image obtained by scanning the card image from the microfilm. Noise due to the poor quality of microfilm data and similarity in gray levels of noise patches and punches are the major problems for text extraction. Thresholding, skew correction and morphological operations are performed on the input gray level image. Card parameters such as positions of punches, etc., are calculated and used along with the knowledge about the contents of the card to separate punched holes from other artifacts. Text data are recognized by locating the punched holes and errors are corrected by a context-based approach. The algorithm has been implemented in software and tested on several images
Keywords
document image processing; microforms; optical character recognition; OCR; context based text recognition; document processing; gray level image; image scanners; microfilm images; morphological operations; punched cards; skew correction; text extraction; thresholding; Data mining; Data preprocessing; History; Image analysis; Image resolution; Military computing; Noise level; Pixel; Software systems; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location
The Hague
Print_ISBN
0-8186-2915-0
Type
conf
DOI
10.1109/ICPR.1992.201761
Filename
201761
Link To Document