• DocumentCode
    1993722
  • Title

    Confidence measures for an address reading system

  • Author

    Brakensiek, Anja ; Rottland, Jörg ; Rigoll, Gerhard

  • Author_Institution
    Univ. Duisburg, Germany
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    294
  • Abstract
    In this paper the performance of different confidence measures used for an address recognition system are evaluated. The recognition system for cursive handwritten German address words is based on hidden Markov models (HMMs). It is essential that the structure of the address (name, street, city, country) is known, so that a specific small but complete dictionary can be selected. Upon choosing a wrong dictionary (OOV: out-of-vocabulary) or misrecognizing a word, the recognition result should be rejected by means of the confidence measure. This paper points out two aspects: the comparison of four confidence measures for single words - based on the likelihood, a garbage-model, a two-best recognition or a character decoding - and the comparison of using complete or wrong dictionaries. It is shown that the best confidence measure - the two-best distance - has a quite different behavior using OOV.
  • Keywords
    decoding; dictionaries; document image processing; handwritten character recognition; hidden Markov models; image coding; postal services; address reading system; address recognition system; automatic recognition system; character decoding; confidence measures; cursive handwritten German address words; dictionary; frame normalized likelihood; garbage-model based confidence; hidden Markov models; out-of-vocabulary word; postal automation; two-best distance; two-best recognition; Automation; Character recognition; Cities and towns; Decoding; Dictionaries; Error analysis; Handwriting recognition; Hidden Markov models; Neural networks; Personal digital assistants;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227676
  • Filename
    1227676