DocumentCode :
1370762
Title :
Robust continuous speech recognition using parallel model combination
Author :
Gales, Mark J F ; Young, Steve J.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Volume :
4
Issue :
5
fYear :
1996
fDate :
9/1/1996 12:00:00 AM
Firstpage :
352
Lastpage :
359
Abstract :
This paper addresses the problem of automatic speech recognition in the presence of interfering noise. It focuses on the parallel model combination (PMC) scheme, which has been shown to be a powerful technique for achieving noise robustness. Most experiments reported on PMC to date have been on small, 10-50 word vocabulary systems. Experiments on the Resource Management (RM) database, a 1000 word continuous speech recognition task, reveal compensation requirements not highlighted by the smaller vocabulary tasks. In particular, that it is necessary to compensate the dynamic parameters as well as the static parameters to achieve good recognition performance. The database used for these experiments was the RM speaker independent task with either Lynx Helicopter noise or Operation Room noise from the NOISEX-92 database added. The experiments reported here used the HTK RM recognizer developed at CUED modified to include PMC based compensation for the static, delta and delta-delta parameters. After training on clean speech data, the performance of the recognizer was found to be severely degraded when noise was added to the speech signal at between 10 and 18 dB. However, using PMC the performance was restored to a level comparable with that obtained when training directly in the noise corrupted environment
Keywords :
acoustic noise; compensation; interference suppression; speech recognition; CUED; HTK RM recognizer; Lynx Helicopter noise; NOISEX-92 database; Operation Room noise; PMC; Resource Management database; automatic speech recognition; delta parameter; delta-delta parameter; dynamic parameters; interfering noise; noise robustness; parallel model combination; robust continuous speech recognition; speech signal; static parameters; Automatic speech recognition; Databases; Helicopters; Noise robustness; Power system modeling; Resource management; Speech enhancement; Speech recognition; Vocabulary; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.536929
Filename :
536929
Link To Document :
بازگشت