Title :
Handwritten address recognition with open vocabulary using character n-grams
Author :
Brakensiek, Anja ; Rottland, Jörg ; Rigoll, Gerhard
Author_Institution :
Dept. of Comput. Sci., Gerhard Mercator Univ., Duisburg, Germany
Abstract :
In this paper a recognition system, based on tied-mixture hidden Markov models, for handwritten address words is described, which makes use of a language model that consists of backoff character n-grams. For a dictionary-based recognition system it is essential that the structure of the address (name, street, city) is known. If the single parts of the address cannot be categorized, the used vocabulary is unknown and thus unlimited. The performance of this open vocabulary recognition using n-grams is compared to the use of dictionaries of different sizes. Especially, the confidence of recognition results and the possibility of a useful post-processing are significant advantages of language models.
Keywords :
dictionaries; feature extraction; handwritten character recognition; hidden Markov models; postal services; visual databases; character n-grams; database; dictionary-based recognition; feature extraction; handwritten address recognition; hidden Markov models; language model; open vocabulary; postal automation; Automation; Character recognition; Cities and towns; Dictionaries; Handwriting recognition; Hidden Markov models; Postal services; Streaming media; Vocabulary; Writing;
Conference_Titel :
Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on
Print_ISBN :
0-7695-1692-0
DOI :
10.1109/IWFHR.2002.1030936