DocumentCode
3020676
Title
PerfectDoc: a ground truthing environment for complex documents
Author
Yacoub, Sherif ; Saxena, Vinay ; Sami, Sayeed Nusrulla
Author_Institution
HP Labs., Barcelona, Spain
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
452
Abstract
In this paper, we present PerfectDoc; a ground truthing and document correction tool. The tool provides post processing correction capabilities that are required after complex document analysis and understanding tasks. The tool has the advantage of being comprehensive (integration of most common correction tasks), easy to use (minimal clicks for corrections), configurable (can be used for different types of documents), and provides separate correction views. We used the tool to correct the output from a document understanding system used to extract articles from 80-years archive of Time weekly magazine.
Keywords
document handling; PerfectDoc ground truthing environment; Time weekly magazine archive; complex document analysis; document correction tool; document understanding system; Algorithm design and analysis; Data mining; Graphical user interfaces; Information analysis; Joining processes; Labeling; Optical character recognition software; Paper technology; Performance analysis; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.187
Filename
1575587
Link To Document