• DocumentCode
    1300678
  • Title

    Does the Cost Function Matter in Bayes Decision Rule?

  • Author

    Schlueter, R. ; Nussbaum-Thom, Markus ; Ney, Hermann

  • Author_Institution
    Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • Volume
    34
  • Issue
    2
  • fYear
    2012
  • Firstpage
    292
  • Lastpage
    301
  • Abstract
    In many tasks in pattern recognition, such as automatic speech recognition (ASR), optical character recognition (OCR), part-of-speech (POS) tagging, and other string recognition tasks, we are faced with a well-known inconsistency: The Bayes decision rule is usually used to minimize string (symbol sequence) error, whereas, in practice, we want to minimize symbol (word, character, tag, etc.) error. When comparing different recognition systems, we do indeed use symbol error rate as an evaluation measure. The topic of this work is to analyze the relation between string (i.e., 0-1) and symbol error (i.e., metric, integer valued) cost functions in the Bayes decision rule, for which fundamental analytic results are derived. Simple conditions are derived for which the Bayes decision rule with integer-valued metric cost function and with 0-1 cost gives the same decisions or leads to classes with limited cost. The corresponding conditions can be tested with complexity linear in the number of classes. The results obtained do not make any assumption w.r.t. the structure of the underlying distributions or the classification problem. Nevertheless, the general analytic results are analyzed via simulations of string recognition problems with Levenshtein (edit) distance cost function. The results support earlier findings that considerable improvements are to be expected when initial error rates are high.
  • Keywords
    Bayes methods; pattern recognition; ASR; Bayes decision rule; OCR; POS; automatic speech recognition; distance cost function; error rate symbol; optical character recognition; part-of-speech; pattern recognition; string recognition; symbol sequence; Bayesian methods; Cost function; Error analysis; Measurement uncertainty; Speech recognition; Statistical analysis; Bayes decision rule; Statistical pattern recognition; classifier design and evaluation; cost/loss function.; Algorithms; Bayes Theorem; Computer Simulation; Pattern Recognition, Automated; Speech Recognition Software;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2011.163
  • Filename
    5989822