Segmentation and classification for mixed text/image documents using neural network

Author

Imade, Shinichi ; Tatsuta, Seiji ; Wada, Toshiaki

fYear

1993

fDate

20-22 Oct 1993

Firstpage

930

Lastpage

934

Abstract

A segmentation and classification method for separating a document image into printed character, handwritten character, photograph, and painted image regions is presented. A document image is segmented into rectangular areas. Each of which contains a cluster of image elements. A layered feed-forward neural network is then used to classify each segmented area using the histograms of gradient vector directions and luminance levels. A high classification performance was obtained, even with a small number of training samples. It is confirmed that the histograms of gradient vector directions and luminance levels are significantly effective features for the classification of the four kinds of image regions. Increasing the number of the discrimination areas improves the classification performance sufficiently even using a small number of training samples for the neural network

Keywords

Data compression; High speed optical techniques; Histograms; Image coding; Image converters; Image segmentation; Image storage; Neural networks; Optical imaging; Pixel;

fLanguage

English

Publisher

ieee

Conference_Titel

Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on

Conference_Location

Tsukuba Science City

Print_ISBN

0-8186-4960-7

Type

conf

DOI

10.1109/ICDAR.1993.395584

Filename

395584