DocumentCode :
971163
Title :
The document spectrum for page layout analysis
Author :
O´Gorman, Lawrence
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
15
Issue :
11
fYear :
1993
fDate :
11/1/1993 12:00:00 AM
Firstpage :
1162
Lastpage :
1173
Abstract :
Page layout analysis is a document processing technique used to determine the format of a page. This paper describes the document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components. The method yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks. It is advantageous over many other methods in three main ways: independence from skew angle, independence from different text spacings, and the ability to process local regions of different text orientations within the same image. Results of the method shown for several different page formats and for randomly oriented subpages on the same image illustrate the versatility of the method. We also discuss the differences, advantages, and disadvantages of the docstrum with respect to other lay-out methods
Keywords :
document image processing; image segmentation; between-line spacings; bottom-up method; docstrum; document image processing; document spectrum; nearest-neighbor clustering; skew; structural page layout analysis; text spacings; within-line spacings; Character recognition; Document image processing; Image analysis; Image segmentation; Independent component analysis; Magnetic analysis; Optical character recognition software; Optical sensors; Performance analysis; Text analysis;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.244677
Filename :
244677
Link To Document :
بازگشت