Title :
Toward island-of-reliability-driven very-large-vocabulary on-line handwriting recognition using character confidence scoring
Author :
Pitrelli, John E. ; Subrahmonia, Jayashree ; Maison, Benoit
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We explore a novel approach for handwriting recognition tasks whose intrinsic vocabularies are too large to be applied directly as constraints during recognition. Our approach makes use of vocabulary constraints, and addresses the issue that some parts of words may be written more recognizably than others. An initial pass is made with an HMM recognizer, without vocabulary constraints, generating a lattice of character-hypothesis arcs representing likely segmentations of the handwriting signal. Arc confidence scores are computed using a posteriori probabilities. The most confidently recognized characters are used to filter the overall vocabulary, generating a word subset manageable for constraining a second recognition pass. With a vocabulary of 273000 words, we can limit to 50000 words in the second pass and eliminate 39.3% of the word errors made by a one-pass recognizer without vocabulary constraints, and 18.3% of errors made using a fixed 30000-word set
Keywords :
character sets; handwriting recognition; hidden Markov models; HMM; character confidence scoring; character-hypothesis arc lattice; island-of-reliability-driven recognition; on-line handwriting recognition; signal segmentations; very large vocabulary recognition; vocabulary constraints; vocabulary filter; word subset; Character generation; Character recognition; Filters; Handwriting recognition; Hidden Markov models; Impedance; Lattices; Memory management; Signal generators; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941222