• DocumentCode
    3593741
  • Title

    How can document analysis help in capturing five million pages?

  • Author

    Suda, P. ; Maderlechner, G. ; Bock, H. ; Klunder, H.P.

  • Author_Institution
    Siemens AG, Munich, Germany
  • Volume
    1
  • fYear
    1995
  • Firstpage
    372
  • Abstract
    This paper describes how document analysis techniques like OCR, layout analysis, model based recognition and interpretation can be fruitfully applied in the field of high-volume, high-accuracy document capturing with very hard time constraints. We describe the way we set up a workflow that enables reliable capturing of real-estate registration documents. Techniques from document analysis are used to speed up the archiving process and to raise its quality. In particular an automatic determination of the positions for input of new text in partially filled text columns is described. This enables to bridge the gap between the non-coded archived documents and the coded information which is used to update the documents later
  • Keywords
    document handling; document image processing; optical character recognition; real estate data processing; OCR; document analysis; document capturing; layout analysis; model based interpretation; model based recognition; real-estate registration documents; Bridges; Hardware; Information management; Insurance; Optical character recognition software; Terminology; Text analysis; Text recognition; Time factors; Turning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.599016
  • Filename
    599016