• DocumentCode
    858841
  • Title

    Composition of a Dewarped and Enhanced Document Image From Two View Images

  • Author

    Koo, Hyung Il ; Kim, Jinho ; Cho, Nam Ik

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Seoul Nat. Univ., Seoul
  • Volume
    18
  • Issue
    7
  • fYear
    2009
  • fDate
    7/1/2009 12:00:00 AM
  • Firstpage
    1551
  • Lastpage
    1562
  • Abstract
    In this paper, we propose an algorithm to compose a geometrically dewarped and visually enhanced image from two document images taken by a digital camera at different angles. Unlike the conventional works that require special equipments or assumptions on the contents of books or complicated image acquisition steps, we estimate the unfolded book or document surface from the corresponding points between two images. For this purpose, the surface and camera matrices are estimated using structure reconstruction, 3-D projection analysis, and random sample consensus-based curve fitting with the cylindrical surface model. Because we do not need any assumption on the contents of books, the proposed method can be applied not only to optical character recognition (OCR), but also to the high-quality digitization of pictures in documents. In addition to the dewarping for a structurally better image, image mosaic is also performed for further improving the visual quality. By finding better parts of images (with less out of focus blur and/or without specular reflections) from either of views, we compose a better image by stitching and blending them. These processes are formulated as energy minimization problems that can be solved using a graph cut method. Experiments on many kinds of book or document images show that the proposed algorithm robustly works and yields visually pleasing results. Also, the OCR rate of the resulting image is comparable to that of document images from a flatbed scanner.
  • Keywords
    curve fitting; document image processing; image enhancement; image reconstruction; image sampling; optical character recognition; 3D projection analysis; OCR rate; cylindrical surface model; digital camera; document image enhancement; energy minimization problems; flatbed scanner; geometric dewarping; graph cut method; image acquisition; image mosaic; optical character recognition; random sample consensus-based curve fitting; structure reconstruction; Document dewarping; document image stitching; robust surface estimation; specular reflection removal;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2009.2019301
  • Filename
    4916075