DocumentCode
177463
Title
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition
Author
Bo Li ; Khe Chai Sim
Author_Institution
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
fYear
2014
fDate
4-9 May 2014
Firstpage
200
Lastpage
204
Abstract
Deep neural networks (DNNs) are capable of modeling large acoustic variations. However, the performance on noisy data is still below humans´ expectations. In this work, we present an ideal hidden-activation masking (IHM) approach to improve their noise robustness. This IHM is inspired by the existing spectral masking techniques. Instead of masking away the noise-dominant components in the spectral domain, we propose to discard DNNs´ inconsistent hidden activations. The IHM is computed from the parallel data to identify hidden units that are immune to environment noise. DNNs then utilize it to improve their prediction robustness with the noise-invariant activations. Experimental results on the Aurora4 task have shown that the proposed IHM is both effective in reducing noise variations and robust to mask estimation errors.
Keywords
neural nets; speech intelligibility; speech recognition; Aurora4 task; DNN; IHM; deep neural networks; ideal hidden-activation masking approach; noise-dominant components; noise-invariant activations; noise-robust speech recognition; spectral masking techniques; Neural networks; Noise; Noise measurement; Noise robustness; Robustness; Speech; Speech recognition; Deep Neural Networks; Noise Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853586
Filename
6853586
Link To Document