Title :
Page Curling Correction for Scanned Books Using Local Distortion Information
Author :
Kluzner, Vladimir ; Tzadok, Asaf
Author_Institution :
Document Process. & Manage. Group, IBM Res. - Haifa, Haifa, Israel
Abstract :
The correction of page curling in scanned document images has attracted a lot of attention in recent years. Fixing page curling is essential because of the resulting damage in the visual perception of the scanned text and the ensuing reduction in OCR performance on the distorted image. It has been generally concluded that correcting the distortion due to page curling will serve as a solid basis for increased OCR accuracy. We present a novel approach for the efficient correction of page curling in the images of scanned book pages. The approach is based on the fact that approximately 70% of the words in any book are recurring terms. Thus, for many distorted words, a distinct and clear reference word can be found. Our work computes a global, polynomial transformation-based correction for the page distortion. This correction is based on the estimation of various local distortions in the given page, which are characterized by located words. Experiments on the scanned page images of an 18th century book printed in Old Gothic font have demonstrated the effectiveness of the proposed technique.
Keywords :
document image processing; optical character recognition; polynomials; OCR accuracy; image distortion; local distortion information; page curling correction; polynomial transformation-based correction; scanned book page image; Books; Minimization; Nonlinear distortion; Optical character recognition software; Optical distortion; Polynomials; Text analysis; de-warping; distortion compensation; distortion correction; page curling; polynomial transformation;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.182