Title :
Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition
Author :
Yan-Hui Tu ; Jun Du ; Li-Rong Dai ; Chin-Hui Lee
Author_Institution :
Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
In this paper, we propose a new signal-noise-dependent (SND) deep neural network (DNN) framework to further improve the separation and recognition performance of the recently developed technique for general DNN-based speech separation. We adopt a divide and conquer strategy to design the proposed SND-DNNs with higher resolutions that a single general DNN could not well accommodate for all the speaker mixing variabilities at different levels of signal-to-noise ratios (SNRs). In this study two kinds of SNR-dependent DNNs, namely positive and negative DNNs, are trained to cover the mixed speech signals with positive and negative SNR levels, respectively. At the separation stage, a first-pass separation using a general DNN can give an accurate SNR estimation for a model selection. Experimental results on the Speech Separation Challenge (SSC) task show that SND-DNNs could yield significant performance improvements for both speech separation and recognition over a general DNN. Furthermore, this purely front-end processing method achieves a relative word error rate reduction of 11.6% over a state-of-the-art recognition system where a complicated joint decoding framework needs to be implemented in the back-end.
Keywords :
divide and conquer methods; neural nets; speech recognition; divide and conquer strategy; first-pass separation; general DNN-based speech separation; mixed speech signals; model selection; relative word error rate reduction; robust speech recognition; signal-noise-dependent deep neural networks; speaker mixing variabilities; Estimation; Hidden Markov models; Signal to noise ratio; Speech; Speech processing; Speech recognition; Training; deep neural networks; robust speech recognition; semi-supervised mode; single-channel speech separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177932