DocumentCode
3322641
Title
A methodology of separating images from text using an OCR approach
Author
Bourbakis, Nikoluos G.
Author_Institution
Center for Intelligent Syst., Binghamton Univ., NY, USA
fYear
1996
fDate
4-5 Nov 1996
Firstpage
311
Lastpage
317
Abstract
This paper presents a document processing methodology based on an OCR approach. The document methodology separates text from images by keeping their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing a single character, after its recognition, continues with the framing of a word, and ends with the framing of all text lines
Keywords
document image processing; image reconstruction; image segmentation; optical character recognition; OCR; document processing; hierarchical framing process; images; page reconstruction; text extraction; text separation; Character generation; Character recognition; Image edge detection; Image recognition; Image reconstruction; Intelligent systems; Object detection; Optical character recognition software; Shape; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligence and Systems, 1996., IEEE International Joint Symposia on
Conference_Location
Rockville, MD
Print_ISBN
0-8186-7728-7
Type
conf
DOI
10.1109/IJSIS.1996.565084
Filename
565084
Link To Document