Title :
Acoustic-phonetic modeling in the SPICOS system
Author :
Ney, Hermann ; Noll, Andreas
Author_Institution :
Lehrstuhl fuer Inf. VI, Rheinisch-Westfalische Tech. Hochschule, Aachen, Germany
fDate :
4/1/1994 12:00:00 AM
Abstract :
This paper deals with the acoustic-phonetic modeling developed in the SPICOS recognition system. The phoneme units are represented by variants of hidden Markov models and are refined step-by-step to improve the recognition performance. Two different approaches to modeling the emission probabilities are investigated, namely discrete models and continuous mixture densities. For the mixture density approach, a straightforward integration into the Viterbi scoring criterion is derived. In the experimental tests, the continuous mixture density approach was found to be superior to the discrete models, given the constraints of the Viterbi framework. The authors describe the details and refinements of the two approaches
Keywords :
hidden Markov models; probability; speech recognition equipment; SPICOS recognition system; Viterbi scoring criterion; acoustic-phonetic modeling; continuous mixture densities; discrete models; emission probabilities; hidden Markov models; phoneme units; recognition performance; Acoustic testing; Context modeling; Gaussian distribution; Hidden Markov models; Robustness; Speech recognition; Terrorism; Training data; Viterbi algorithm; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on