DocumentCode :
454729
Title :
Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework
Author :
Xu, Haitian ; Tan, Zheng-Hua ; Dalsgaard, Paul ; Lindberg, Børge
Author_Institution :
Center for TeleInFrastructure, Aalborg Univ.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Compared to multi-condition training (MTR), condition-dependent training generates multiple acoustic hidden Markov model sets each identified by a noisy environment and is known to perform substantially better for known noise types (included in training) while worse for unknown (untrained) noise types. This paper attempts to bridge the performance gap between known and unknown noise types by introducing a minimum mean-square error (MMSE) noise-type based compensation algorithm. On the basis of a modified vector Taylor series and the measurement of feature reliability as well as noise similarity, the MMSE estimation adapts the test features corrupted by the unknown noise type to the corresponding features corrupted by the known noise type. This method significantly improves the recognition performance for unknown noise types while maintaining the good performance for known noise types. Furthermore, in order to benefit directly from MTR, a model interpolation strategy is investigated which combines the MTR and the condition-dependent model sets. Both good performance and low computational cost are achieved by only interpolating the mixtures of each condition-dependent model state with the least weighted mixture in the corresponding MTR model state. The overall system gives promising results
Keywords :
interpolation; least mean squares methods; series (mathematics); speech recognition; MMSE; condition-dependent model sets; minimum mean-square error; model interpolation; modified vector Taylor series; multicondition training; noise-type based feature compensation; robust speech recognition; Acoustic noise; Bridges; Hidden Markov models; Interpolation; Noise generators; Noise measurement; Noise robustness; Speech recognition; Taylor series; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660227
Filename :
1660227
Link To Document :
بازگشت