DocumentCode :
3014477
Title :
Multi-style training for robust isolated-word speech recognition
Author :
Lippman, Richard P. ; Martin, Edward A. ; Paul, Douglas B.
Author_Institution :
Massachusetts Institute of Technology, Lexington, Massachusetts
Volume :
12
fYear :
1987
fDate :
31868
Firstpage :
705
Lastpage :
708
Abstract :
A new training procedure called multi-style training has been developed to improve performance when a recognizer is used under stress or in high noise but cannot be trained in these conditions. Instead of speaking normally during training, talkers use different, easily produced, talking styles. This technique was tested using a speech data base that included stress speech produced during a workload task and when intense noise was presented through earphones. A continuous-distribution talker-dependent Hidden Markov Model (HMM) recognizer was trained both normally (5 normally spoken tokens) and with multi-style training (one token each from normal, fast, clear, loud, and question-pitch talking styles). The average error rate under stress and normal conditions fell by more than a factor of two with multi-style training and the average error rate under conditions sampled during training fell by a factor of four.
Keywords :
Cepstral analysis; Degradation; Error analysis; Hidden Markov models; Noise shaping; Robustness; Speech enhancement; Speech recognition; Stress; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type :
conf
DOI :
10.1109/ICASSP.1987.1169544
Filename :
1169544
Link To Document :
بازگشت