• DocumentCode
    762420
  • Title

    Restoring warped document images through 3D shape modeling

  • Author

    Tan, Chew Lim ; Zhang, Li ; Zhang, Zheng ; Xia, Tao

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore
  • Volume
    28
  • Issue
    2
  • fYear
    2006
  • Firstpage
    195
  • Lastpage
    208
  • Abstract
    Scanning a document page from a thick bound volume often results in two kinds of distortions in the scanned image, i.e., shade along the "spine" of the book and warping in the shade area. In this paper, we propose an efficient restoration method based on the discovery of the 3D shape of a book surface from the shading information in a scanned document image. From a technical point of view, this shape from shading (SFS) problem in real-world environments is characterized by 1) a proximal and moving light source, 2) Lambertian reflection, 3) nonuniform albedo distribution, and 4) document skew. Taking all these factors into account, we first build practical models (consisting of a 3D geometric model and a 3D optical model) for the practical scanning conditions to reconstruct the 3D shape of the book surface. We next restore the scanned document image using this shape based on deshading and dewarping models. Finally, we evaluate the restoration results by comparing our estimated surface shape with the real shape as well as the OCR performance on original and restored document images. The results show that the geometric and photometric distortions are mostly removed and the OCR results are improved markedly.
  • Keywords
    document image processing; image restoration; optical character recognition; 3D shape modeling; OCR performance; deshading model; dewarping model; image restoration; photometric distortions; warped document images; Books; Geometrical optics; Image restoration; Light sources; Optical character recognition software; Optical distortion; Optical reflection; Shape; Solid modeling; Surface reconstruction; Index Terms- Document image restoration; OCR improvement.; document image analysis; image distortion; image warping; shape from shading; Algorithms; Artifacts; Artificial Intelligence; Automatic Data Processing; Computer Graphics; Computer Simulation; Documentation; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Information Storage and Retrieval; Models, Theoretical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2006.40
  • Filename
    1561180