Title :
Deep recurrent regularization neural network for speech recognition
Author :
Jen-Tzung Chien ; Tsai-Wei Lu
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Abstract :
This paper presents a deep recurrent regularization neural network (DRRNN) for speech recognition. Our idea is to build a regularization neural network acoustic model by conducting the hybrid Tikhonov and weight-decay regularization which compensates the variations due to the input speech as well as the model parameters in the restricted Boltzmann machine as a pre-training stage for feature learning and structural modeling. In addition, a new backpropagation through time (BPTT) algorithm is developed by extending the truncated minibatch training for recurrent neural network where the minibatch BPTT is not only performed in recurrent layer but also in feedforward layer. The DRRNN acoustic model is accordingly established to capture the temporal correlation in a regularization neural network. Experimental results on the tasks of RM and Aurora4 show the effectiveness and robustness of using DRRNN for speech recognition.
Keywords :
Boltzmann machines; speech recognition; BPTT algorithm; DRRNN acoustic model; backpropagation through time algorithm; deep recurrent regularization neural network acoustic model; hybrid Tikhonov and weight-decay regularization; restricted Boltzmann machine; speech recognition; truncated minibatch training; Acoustics; Hidden Markov models; Neurons; Recurrent neural networks; Speech; Training; Recurrent neural network; acoustic model; deep learning; model regularization;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178834