مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

DocumentCode :

454729

Title :

Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

Author :

Xu, Haitian ; Tan, Zheng-Hua ; Dalsgaard, Paul ; Lindberg, Børge

Author_Institution :

Center for TeleInFrastructure, Aalborg Univ.

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

Compared to multi-condition training (MTR), condition-dependent training generates multiple acoustic hidden Markov model sets each identified by a noisy environment and is known to perform substantially better for known noise types (included in training) while worse for unknown (untrained) noise types. This paper attempts to bridge the performance gap between known and unknown noise types by introducing a minimum mean-square error (MMSE) noise-type based compensation algorithm. On the basis of a modified vector Taylor series and the measurement of feature reliability as well as noise similarity, the MMSE estimation adapts the test features corrupted by the unknown noise type to the corresponding features corrupted by the known noise type. This method significantly improves the recognition performance for unknown noise types while maintaining the good performance for known noise types. Furthermore, in order to benefit directly from MTR, a model interpolation strategy is investigated which combines the MTR and the condition-dependent model sets. Both good performance and low computational cost are achieved by only interpolating the mixtures of each condition-dependent model state with the least weighted mixture in the corresponding MTR model state. The overall system gives promising results

Keywords :

interpolation; least mean squares methods; series (mathematics); speech recognition; MMSE; condition-dependent model sets; minimum mean-square error; model interpolation; modified vector Taylor series; multicondition training; noise-type based feature compensation; robust speech recognition; Acoustic noise; Bridges; Hidden Markov models; Interpolation; Noise generators; Noise measurement; Noise robustness; Speech recognition; Taylor series; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1660227

Filename :

1660227

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=454729