Title :
Segmentation and classification of document images
Author :
Antonacopoulos, A. ; Ritchings, R.T.
Author_Institution :
Dept. of Comput. Sci., Liverpool Univ., UK
fDate :
11/2/1995 12:00:00 AM
Abstract :
There is a significant and growing need to convert documents from printed paper to an electronic form. Document image analysis is concerned with the segmentation of the document image into regions of interest, their description, and the classification of the regions according to the type of their contents. A new unified approach to page segmentation and classification, based on the description of the background with tiles, is presented. The segmentation method is flexible to successfully analyse and describe regions in complicated layouts where other methods fail. Images with severe skew are handled equally well with no additional computations. The classification is based on textural features which are derived by simple calculations from the representation of space in the regions, produced during the segmentation process. This is a considerable advantage over previous methods where extra image accesses and lengthy computations are necessary. Overall, the whole approach of segmentation and classification by white tiles is fast and efficient as no time-consuming processes are required
Keywords :
document image processing; image classification; image representation; image segmentation; image texture; background description; complicated layouts; contents; document conversion; document image analysis; document image classification; document image segmentation; electronic documents; image region classification; page classification; page segmentation; printed documents; severe skew images; space representation; textural features; tiles; white tiles;
Conference_Titel :
Document Image Processing and Multimedia Environments, IEE Colloquium on
Conference_Location :
London
DOI :
10.1049/ic:19951197