Title :
Segmentation of Text and Graphics from Document Images
Author :
Chowdhury, S.P. ; Mandal, S. ; Das, A.K. ; Chanda, Bhabatosh
Author_Institution :
B. E. & Sc. Univ., Durgapur
Abstract :
Text, graphics and half-tones are the major constituents of any document page. While half-tone can be characterised by its inherent intensity variation, text and graphics share common characteristics except difference in spatial distribution. The success of document image analysis systems depends on the proper segmentation. The success of document image analysis systems depends on the proper segmentation of text and graphics as text is further subdivided into other classes such as heading, table and math-zones. Segmentation of graphics is essential for better OCR performance and vectorization in computer vision applications. Graphics segmentation from text is particularly difficult in the context of graphics made of small components (dashed or dotted lines etc.) which have many features similar to texts. Here we propose a robust technique for segmenting all sorts of graphics and texts in any orientation from document pages.
Keywords :
computer graphics; computer vision; document image processing; image segmentation; optical character recognition; text analysis; OCR; computer vision; document image analysis system; graphics segmentation; text segmentation; Computer graphics; Computer vision; Engineering drawings; Filtering; Filters; Image analysis; Image segmentation; Optical character recognition software; Robustness; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4376989