• DocumentCode
    1636301
  • Title

    Handling Out-of-Vocabulary Words and Recognition Errors Based on Word Linguistic Context for Handwritten Sentence Recognition

  • Author

    Quiniou, Solen ; Cheriet, Mohamed ; Anquetil, Eric

  • Author_Institution
    Synchromdia Lab., Ecole de Technol. Super., Montreal, QC, Canada
  • fYear
    2009
  • Firstpage
    466
  • Lastpage
    470
  • Abstract
    In this paper we investigate the use of linguistic information given by language models to deal with word recognition errors on handwritten sentences. We focus especially on errors due to out-of-vocabulary (OOV) words. First, word posterior probabilities are computed and used to detect error hypotheses on output sentences. An SVM classifier allows these errors to be categorized according to defined types. Then, a post-processing step is performed using a language model based on part-of-speech (POS) tags which is combined to the n-gram model previously used. Thus, error hypotheses can be further recognized and POS tags can be assigned to the OOV words. Experiments on on-line handwritten sentences show that the proposed approach allows a significant reduction of the word error rate.
  • Keywords
    computational linguistics; error statistics; handwritten character recognition; image classification; natural language processing; support vector machines; text analysis; OOV word; POS tag; SVM classifier; error hypotheses detection; handwritten text recognition system; language model; n-gram model; online handwritten sentence recognition system; out-of-vocabulary word; part-of-speech tag; post-processing step; recognition error rate reduction; word linguistic context; word posterior probability; Context modeling; Document handling; Error correction; Handwriting recognition; Information analysis; Laboratories; Speech recognition; Text analysis; Text recognition; Vocabulary; Handwritten sentence recognition; error detection; language models; out-of-vocabulary words; part-of-speech tags;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.78
  • Filename
    5277628