• DocumentCode
    2630270
  • Title

    Interpreting word recognition decisions with a document database graph

  • Author

    Hull, Jonathan J. ; Li, Yanhong

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    488
  • Lastpage
    492
  • Abstract
    A method is presented to filter the output of a word recognition algorithm, which may contain errors, to locate decisions that should be correct with a high degree of certainty. The algorithm uses the output of a word recognition system and techniques used in information retrieval to characterize a free-text document database to locate a set of documents that have topics which are similar to that of the input document. The vocabulary from these similar documents is then used to locate the correct word recognition decisions. Experimental results show that a subset of the word recognition decisions for an input document can be located that are between 90 and 99% correct. The subset located by this method can be used to drive other recognition processes applied to the rest of the text
  • Keywords
    database management systems; document handling; optical character recognition; word processing; document database graph; free-text document database; information retrieval; input document; vocabulary; word recognition algorithm; word recognition decisions; Character recognition; Dictionaries; Filters; Image databases; Image recognition; Information retrieval; Text analysis; Text recognition; Visual databases; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395689
  • Filename
    395689