DocumentCode :
1992840
Title :
An algorithm for finding maximal whitespace rectangles at arbitrary orientations for document layout analysis
Author :
Breuel, Thomas M.
Author_Institution :
PARC, Inc., Palo Alto, CA, USA
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
66
Abstract :
The analysis of the background structure (whitespace) of page images has become an important technique for physical document layout analysis. Globally maximal whites-pace rectangles have been previously demonstrated to constitute a concise representation of the major layout features of documents. However, previous methods for computing maximal whitespace rectangles were limited to axis-aligned rectangles. This paper presents an algorithm that finds globally maximal whitespace rectangles on page images at arbitrary orientations. The new algorithm eliminates the need for page rotation correction prior to background analysis and can be applied to considerably more complex page layouts than previously possible. The algorithm is resolution independent and takes as input a list of foreground shapes (e.g., character or word bounding boxes or polygons) and a set of parameter ranges; it outputs the N largest non-overlapping maximal whitespace rectangles whose parameters (location, width, height, orientation) fall within the required parameter ranges. Examples of applications of the method to severely skewed documents, as well as the UW3 database, are presented.
Keywords :
document image processing; image recognition; UW3 database; arbitrary orientations; axis-aligned rectangles; background structure; document layout analysis; largest nonoverlapping maximal whitespace rectangles; layout features; page background analysis; page images; page layouts; page rotation correction; severely skewed documents; Algorithm design and analysis; Computer vision; Databases; Image analysis; Particle separators; Shape; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227629
Filename :
1227629
Link To Document :
بازگشت