DocumentCode
2229397
Title
A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition
Author
Aubert, Xavier ; Beyerlein, Peter ; Ullrich, Meinhard
Author_Institution
Philips GmbH Forschungslab., Aachen, Germany
Volume
1
fYear
1996
fDate
3-6 Oct 1996
Firstpage
14
Abstract
Presents an extension of bottom-up state-tying towards improved handling of unseen triphones. As opposed to the usual backing-off to diphones and monophones, the current method aims at finding a triphone model that has proven to exhibit some similarity with the unseen triphone. It is based on a probabilistic mapping of unseen contexts to clusters of triphone states observed in the training data. This algorithm has been applied to dictation tasks for three languages with vocabulary sizes ranging from 20k to 64k. The results compare favorably with those obtained using standard back-off rules. This technique also offers an alternative to top-down decision-tree procedures which are frequently used, especially for their generalization capabilities
Keywords
dictation; hidden Markov models; probability; speech recognition; vocabulary; back-off rules; backing-off; bottom-up state-tying; dictation tasks; generalization capabilities; languages; large-vocabulary continuous speech recognition; probabilistic mapping; similarity; top-down decision-tree procedures; training data; triphone state clusters; unseen triphones; vocabulary size; Buildings; Clustering algorithms; Context modeling; Databases; Decision trees; Decoding; Laboratories; Speech recognition; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.606918
Filename
606918
Link To Document