• DocumentCode
    1994915
  • Title

    Progress in camera-based document image analysis

  • Author

    Doermann, David ; Liang, Jian ; Li, Huiping

  • Author_Institution
    Language & Media Process. Lab., Maryland Univ., College Park, MD, USA
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    606
  • Abstract
    The increasing availability of high performance, low priced, portable digital imaging devices has created a tremendous opportunity for supplementing traditional scanning for document image acquisition. Digital cameras attached to cellular phones, PDAs, or as standalone still or video devices are highly mobile and easy to use; they can capture images of any kind of document including very thick books, historical pages too fragile to touch, and text in scenes; and they are much more versatile than desktop scanners. Should robust solutions to the analysis of documents captured with such devices become available, there is clearly a demand from many domains. Traditional scanner-based document analysis techniques provide us with a good reference and starting point, but they cannot be used directly on camera-captured images. Camera captured images can suffer from low resolution, blur, and perspective distortion, as well as complex layout and interaction of the content and background. In this paper we present a survey of application domains, technical challenges and solutions for recognizing documents captured by digital cameras. We begin by describing typical imaging devices and the imaging process. We discuss document analysis from a single camera-captured image as well as multiple frames and highlight some sample applications under development and feasible ideas for future development.
  • Keywords
    document image processing; image recognition; optical character recognition; camera-based document image analysis; camera-captured images; cellular phones; desktop scanners; digital cameras; digital imaging devices; document image acquisition; intelligent character recognition; optical character recognition; perspective distortion; scanner-based document analysis; video devices; Availability; Books; Cellular phones; Digital cameras; Digital images; High-resolution imaging; Image analysis; Layout; Personal digital assistants; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227735
  • Filename
    1227735