Title :
A methodology of separating images from text using an OCR approach
Author :
Bourbakis, Nikoluos G.
Author_Institution :
Center for Intelligent Syst., Binghamton Univ., NY, USA
Abstract :
This paper presents a document processing methodology based on an OCR approach. The document methodology separates text from images by keeping their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing a single character, after its recognition, continues with the framing of a word, and ends with the framing of all text lines
Keywords :
document image processing; image reconstruction; image segmentation; optical character recognition; OCR; document processing; hierarchical framing process; images; page reconstruction; text extraction; text separation; Character generation; Character recognition; Image edge detection; Image recognition; Image reconstruction; Intelligent systems; Object detection; Optical character recognition software; Shape; Text recognition;
Conference_Titel :
Intelligence and Systems, 1996., IEEE International Joint Symposia on
Conference_Location :
Rockville, MD
Print_ISBN :
0-8186-7728-7
DOI :
10.1109/IJSIS.1996.565084