• DocumentCode
    2520878
  • Title

    Handling artifacts in digitally reproduced documents

  • Author

    Cinque, L. ; Levialdi, S. ; Lombardi, L. ; Tanimoto, S.

  • Author_Institution
    Dept. of Inf. Sci., Rome Univ., Italy
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    340
  • Lastpage
    346
  • Abstract
    The analysis of scanned documents is important in the construction of digital libraries and paperless offices. One significant challenge is coping with artifacts of photocopying and scanning. We present a series of simple techniques for handling these difficulties. Using 125 images of the University of Washington scanned documents database, we demonstrate the effectiveness of these methods in preparing the images for segmentation by a multiresolution algorithm
  • Keywords
    digital libraries; document image processing; image segmentation; artifacts; digital libraries; multiresolution algorithm; paperless offices; photocopying; scanned documents; scanned documents database; scanning; segmentation; Document handling; Image databases; Image resolution; Image segmentation; Information analysis; Information science; Printing; Remuneration; Software libraries; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architectures for Machine Perception, 2000. Proceedings. Fifth IEEE International Workshop on
  • Conference_Location
    Padova
  • Print_ISBN
    0-7695-0740-9
  • Type

    conf

  • DOI
    10.1109/CAMP.2000.875993
  • Filename
    875993