Title :
Speech recognition using temporally connected kernels in mixture density hidden Markov models
Author_Institution :
Neural Networks Res. Centre, Helsinki Univ. of Technol., Espoo, Finland
Abstract :
A method is presented for speeding up the performance of the HMM based speech recognition system where the states are modeled by a large number of Gaussian kernels. The emission probabilities of the states are usually dominated by the nearest Gaussians to the input vector. The speedup is gained without deteriorating the recognition accuracy by concentrating on these kernels in the reduced K-best-kernel search. In this work, the time information of the input is encoded to the connections of the kernels. The search for the dominating kernels is then performed along the kernel connections which model the trajectories of the speech in the feature space. In the experiments, speaker-dependent speech recognizers were trained for ten speakers. The number of distance computations between feature vectors and kernel mean vectors was reduced 75% without increasing the average phoneme recognition error, which was 5.7% for the baseline system
Keywords :
Gaussian distribution; hidden Markov models; speech recognition; Gaussian kernels; HMM based speech recognition system; PDF; average phoneme recognition error; emission probabilities; experiments; feature space; feature vectors; kernel mean vectors; mixture density hidden Markov models; performance; recognition accuracy; reduced K-best-kernel search; speaker-dependent speech recognizers; speech trajectories; temporally connected kernels; time information; Artificial neural networks; Hidden Markov models; Intelligent networks; Kernel; Neural networks; Pattern recognition; Probability density function; Self organizing feature maps; Speech recognition; Unsupervised learning;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.860139