DocumentCode :
2917074
Title :
Rectification and 3D reconstruction of curved document images
Author :
Tian, Yuandong ; Narasimhan, Srinivasa G.
Author_Institution :
Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
377
Lastpage :
384
Abstract :
Distortions in images of documents, such as the pages of books, adversely affect the performance of optical character recognition (OCR) systems. Removing such distortions requires the 3D deformation of the document that is often measured using special and precisely calibrated hardware (stereo, laser range scanning or structured light). In this paper, we introduce a new approach that automatically reconstructs the 3D shape and rectifies a deformed text document from a single image. We first estimate the 2D distortion grid in an image by exploiting the line structure and stroke statistics in text documents. This approach does not rely on more noise-sensitive operations such as image binarization and character segmentation. The regularity in the text pattern is used to constrain the 2D distortion grid to be a perspective projection of a 3D parallelogram mesh. Based on this constraint, we present a new shape-from-texture method that computes the 3D deformation up to a scale factor using SVD. Unlike previous work, this formulation imposes no restrictions on the shape (e.g., a developable surface). The estimated shape is then used to remove both geometric distortions and photometric (shading) effects in the image. We demonstrate our techniques on documents containing a variety of languages, fonts and sizes.
Keywords :
document image processing; image reconstruction; optical character recognition; text analysis; 2D distortion grid; 3D document deformation; 3D image reconstruction; 3D parallelogram mesh; curved document image; image rectification; optical character recognition system; shape-from-texture method; singular value decomposition; text document; text pattern regularity; Estimation; Image reconstruction; Optical character recognition software; Optical distortion; Shape; Surface reconstruction; Three dimensional displays;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995540
Filename :
5995540
Link To Document :
بازگشت