• DocumentCode
    304812
  • Title

    Document image decoding in the Berkeley Digital Library

  • Author

    Kopec, Gary E.

  • Author_Institution
    Xerox Palo Alto Res. Center, CA, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    16-19 Sep 1996
  • Firstpage
    769
  • Abstract
    The UC Berkeley Environmental Digital Library Project is one of six university led projects that were initiated in the fall of 1994 as part of a four year digital library initiative sponsored by the NSF, NASA and ARPA. The Berkeley project is particularly interesting from an image processing perspective because its testbed collection consists almost entirely of scanned materials. As a result, the Berkeley project is making extensive use of document recognition and other image analysis technology to provide content based access to the collection. The Document Image Decoding (DID) group at Xerox PARC is a member of the Berkeley team and is investigating the application of DID techniques to providing high quality (accurate and properly structured) transcriptions of scanned documents. The paper briefly describes the Berkeley project, discusses some of its recognition requirements and presents an example of an advanced structured document created using DID technology
  • Keywords
    document image processing; image coding; image recognition; library automation; DID technology; UC Berkeley Environmental Digital Library Project; advanced structured document; content based access; digital library initiative; document image decoding; document recognition; image analysis technology; image processing; recognition requirements; scanned materials; testbed collection; university led projects; Decoding; Error analysis; Hyperspectral imaging; Image processing; Image retrieval; Indexes; NASA; Optical character recognition software; Software libraries; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1996. Proceedings., International Conference on
  • Conference_Location
    Lausanne
  • Print_ISBN
    0-7803-3259-8
  • Type

    conf

  • DOI
    10.1109/ICIP.1996.561014
  • Filename
    561014