• DocumentCode
    3322641
  • Title

    A methodology of separating images from text using an OCR approach

  • Author

    Bourbakis, Nikoluos G.

  • Author_Institution
    Center for Intelligent Syst., Binghamton Univ., NY, USA
  • fYear
    1996
  • fDate
    4-5 Nov 1996
  • Firstpage
    311
  • Lastpage
    317
  • Abstract
    This paper presents a document processing methodology based on an OCR approach. The document methodology separates text from images by keeping their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing a single character, after its recognition, continues with the framing of a word, and ends with the framing of all text lines
  • Keywords
    document image processing; image reconstruction; image segmentation; optical character recognition; OCR; document processing; hierarchical framing process; images; page reconstruction; text extraction; text separation; Character generation; Character recognition; Image edge detection; Image recognition; Image reconstruction; Intelligent systems; Object detection; Optical character recognition software; Shape; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Systems, 1996., IEEE International Joint Symposia on
  • Conference_Location
    Rockville, MD
  • Print_ISBN
    0-8186-7728-7
  • Type

    conf

  • DOI
    10.1109/IJSIS.1996.565084
  • Filename
    565084