Title :
Experiments on mixture-density phoneme-modelling for the speaker-independent 1000-word speech recognition DARPA task
Author_Institution :
Philips GmbH Forschunglaboratorium, Hamburg, West Germany
Abstract :
The modifications and improvements of the acoustic recognition component of the SPICOS system for the DARPA naval resource management task are described. These modifications and improvements include: the modeling of the continuous mixture densities of the acoustic vectors, the choice of suitable context-dependent phoneme units and the construction of generalized context phoneme units, and the modeling of transitional information in the acoustic vector. The experimental results show that critical factors are the acoustic resolution of the probability distributions and the context information captured in the acoustic vectors. By these enhancements, the system was able to attain a word error rate of 23.6% and 26.5% on two test sets in speaker-independent recognition mode, when trained on 80 speakers. The word pair grammar reduced the word error rate to 7.1% and 9.3% respectively
Keywords :
military computing; probability; speech recognition; DARPA; SPICOS; acoustic vectors; context information; context-dependent phoneme units; mixture-density phoneme-modelling; naval resource management task; probability distributions; speaker independent speech recognition; word error rate; Acoustic testing; Context modeling; Error analysis; Hidden Markov models; Loudspeakers; Probability distribution; Resource management; Speech recognition; System testing; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115878