• DocumentCode
    3020676
  • Title

    PerfectDoc: a ground truthing environment for complex documents

  • Author

    Yacoub, Sherif ; Saxena, Vinay ; Sami, Sayeed Nusrulla

  • Author_Institution
    HP Labs., Barcelona, Spain
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    452
  • Abstract
    In this paper, we present PerfectDoc; a ground truthing and document correction tool. The tool provides post processing correction capabilities that are required after complex document analysis and understanding tasks. The tool has the advantage of being comprehensive (integration of most common correction tasks), easy to use (minimal clicks for corrections), configurable (can be used for different types of documents), and provides separate correction views. We used the tool to correct the output from a document understanding system used to extract articles from 80-years archive of Time weekly magazine.
  • Keywords
    document handling; PerfectDoc ground truthing environment; Time weekly magazine archive; complex document analysis; document correction tool; document understanding system; Algorithm design and analysis; Data mining; Graphical user interfaces; Information analysis; Joining processes; Labeling; Optical character recognition software; Paper technology; Performance analysis; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.187
  • Filename
    1575587