DocumentCode
1189020
Title
Automated quality assurance for document understanding systems
Author
Yacoub, Sherif
Volume
20
Issue
3
fYear
2003
Firstpage
76
Lastpage
82
Abstract
To process high-volume input data, such as the scanned images of publishers´ book and journal collections, content understanding systems should run automatically, continuously, and without human attendance. Ensuring the output quality of such systems is a challenging task, however, and automated quality assurance techniques are thus essential to its success. The author discusses three automated QA techniques that were developed for Hewlett-Packard´s Digital Content ReMastering system.
Keywords
document image processing; quality control; text analysis; Digital Content ReMastering system; Hewlett-Packard; automated quality assurance; book collections; content understanding systems; document understanding systems; journal collections; Computer architecture; Hardware; Humans; Material storage; Network servers; Optical character recognition software; Quality assurance; Switches; Workstations; XML;
fLanguage
English
Journal_Title
Software, IEEE
Publisher
ieee
ISSN
0740-7459
Type
jour
DOI
10.1109/MS.2003.1196325
Filename
1196325
Link To Document