Title :
PerfectDoc: a ground truthing environment for complex documents
Author :
Yacoub, Sherif ; Saxena, Vinay ; Sami, Sayeed Nusrulla
Author_Institution :
HP Labs., Barcelona, Spain
fDate :
29 Aug.-1 Sept. 2005
Abstract :
In this paper, we present PerfectDoc; a ground truthing and document correction tool. The tool provides post processing correction capabilities that are required after complex document analysis and understanding tasks. The tool has the advantage of being comprehensive (integration of most common correction tasks), easy to use (minimal clicks for corrections), configurable (can be used for different types of documents), and provides separate correction views. We used the tool to correct the output from a document understanding system used to extract articles from 80-years archive of Time weekly magazine.
Keywords :
document handling; PerfectDoc ground truthing environment; Time weekly magazine archive; complex document analysis; document correction tool; document understanding system; Algorithm design and analysis; Data mining; Graphical user interfaces; Information analysis; Joining processes; Labeling; Optical character recognition software; Paper technology; Performance analysis; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
Print_ISBN :
0-7695-2420-6
DOI :
10.1109/ICDAR.2005.187