DocumentCode
304812
Title
Document image decoding in the Berkeley Digital Library
Author
Kopec, Gary E.
Author_Institution
Xerox Palo Alto Res. Center, CA, USA
Volume
1
fYear
1996
fDate
16-19 Sep 1996
Firstpage
769
Abstract
The UC Berkeley Environmental Digital Library Project is one of six university led projects that were initiated in the fall of 1994 as part of a four year digital library initiative sponsored by the NSF, NASA and ARPA. The Berkeley project is particularly interesting from an image processing perspective because its testbed collection consists almost entirely of scanned materials. As a result, the Berkeley project is making extensive use of document recognition and other image analysis technology to provide content based access to the collection. The Document Image Decoding (DID) group at Xerox PARC is a member of the Berkeley team and is investigating the application of DID techniques to providing high quality (accurate and properly structured) transcriptions of scanned documents. The paper briefly describes the Berkeley project, discusses some of its recognition requirements and presents an example of an advanced structured document created using DID technology
Keywords
document image processing; image coding; image recognition; library automation; DID technology; UC Berkeley Environmental Digital Library Project; advanced structured document; content based access; digital library initiative; document image decoding; document recognition; image analysis technology; image processing; recognition requirements; scanned materials; testbed collection; university led projects; Decoding; Error analysis; Hyperspectral imaging; Image processing; Image retrieval; Indexes; NASA; Optical character recognition software; Software libraries; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 1996. Proceedings., International Conference on
Conference_Location
Lausanne
Print_ISBN
0-7803-3259-8
Type
conf
DOI
10.1109/ICIP.1996.561014
Filename
561014
Link To Document