Title :
Lexicon reduction in an framework based on quantized feature vectors
Author :
Kaufmann, G. ; Bunke, H. ; Hadorn, M.
Author_Institution :
Inst. fur Inf. und Angewandte Math., Berne Univ., Switzerland
Abstract :
For many applications in cursive script recognition the vocabulary is restricted to a small and fixed set of words. In a recognition approach such as Hidden Markov models, for each of these words a model is constructed and trained. In the recognition task an unknown pattern is matched with all of these models to find the most likely class. In this paper, we describe a method for reducing the size of vocabulary depending on the actual input. In contrast to many other techniques, our approach does not use any topological features. The reduction system is directly based on the quantized feature vectors which are used as input for the HMMs. Thus very little additional work is required for lexicon reduction. The proposed approach was successfully tested on two different systems with small lexicons. In both cases the lexicon could be reduced to 25% of its original size without increasing the error rate
Keywords :
feature extraction; hidden Markov models; optical character recognition; cursive script recognition; framework; hidden Markov models; lexicon reduction; quantized feature vectors; reduction system; vocabulary; vocabulary depending approach; Error analysis; Feature extraction; Hidden Markov models; Pattern matching; Pattern recognition; System testing; Text recognition; Vocabulary;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.620678