DocumentCode :
3057945
Title :
Text data extraction from microfilm images of punched cards
Author :
Kumar, Sudha U. ; Kasturi, Rangachar
Author_Institution :
Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
1992
fDate :
30 Aug-3 Sep 1992
Firstpage :
230
Lastpage :
233
Abstract :
A system for reading text data from microfilm images of punched cards is described. The input is a high resolution gray level image obtained by scanning the card image from the microfilm. Noise due to the poor quality of microfilm data and similarity in gray levels of noise patches and punches are the major problems for text extraction. Thresholding, skew correction and morphological operations are performed on the input gray level image. Card parameters such as positions of punches, etc., are calculated and used along with the knowledge about the contents of the card to separate punched holes from other artifacts. Text data are recognized by locating the punched holes and errors are corrected by a context-based approach. The algorithm has been implemented in software and tested on several images
Keywords :
document image processing; microforms; optical character recognition; OCR; context based text recognition; document processing; gray level image; image scanners; microfilm images; morphological operations; punched cards; skew correction; text extraction; thresholding; Data mining; Data preprocessing; History; Image analysis; Image resolution; Military computing; Noise level; Pixel; Software systems; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
Type :
conf
DOI :
10.1109/ICPR.1992.201761
Filename :
201761
Link To Document :
بازگشت