• DocumentCode
    3057945
  • Title

    Text data extraction from microfilm images of punched cards

  • Author

    Kumar, Sudha U. ; Kasturi, Rangachar

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    1992
  • fDate
    30 Aug-3 Sep 1992
  • Firstpage
    230
  • Lastpage
    233
  • Abstract
    A system for reading text data from microfilm images of punched cards is described. The input is a high resolution gray level image obtained by scanning the card image from the microfilm. Noise due to the poor quality of microfilm data and similarity in gray levels of noise patches and punches are the major problems for text extraction. Thresholding, skew correction and morphological operations are performed on the input gray level image. Card parameters such as positions of punches, etc., are calculated and used along with the knowledge about the contents of the card to separate punched holes from other artifacts. Text data are recognized by locating the punched holes and errors are corrected by a context-based approach. The algorithm has been implemented in software and tested on several images
  • Keywords
    document image processing; microforms; optical character recognition; OCR; context based text recognition; document processing; gray level image; image scanners; microfilm images; morphological operations; punched cards; skew correction; text extraction; thresholding; Data mining; Data preprocessing; History; Image analysis; Image resolution; Military computing; Noise level; Pixel; Software systems; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
  • Conference_Location
    The Hague
  • Print_ISBN
    0-8186-2915-0
  • Type

    conf

  • DOI
    10.1109/ICPR.1992.201761
  • Filename
    201761