DocumentCode :
178713
Title :
Stochastic pooling maxout networks for low-resource speech recognition
Author :
Meng Cai ; Yongzhe Shi ; Jia Liu
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
3266
Lastpage :
3270
Abstract :
Maxout network is a powerful alternate to traditional sigmoid neural networks and is showing success in speech recognition. However, maxout network is prone to overfitting thus regularization methods such as dropout are often needed. In this paper, a stochastic pooling regularization method for max-out networks is proposed to control overfitting. In stochastic pooling, a distribution is produced for each pooling region by the softmax normalization of the piece values. The active piece is selected based on the distribution during training, and an effective probability weighting is conducted during testing. We apply the stochastic pooling maxout (SPM) networks within the DNN-HMM framework and evaluate its effectiveness under a low-resource speech recognition condition. On benchmark test sets, the SPM network yields 4.7-8.6% relative improvements over the baseline maxout network. Further evaluations show the superiority of stochastic pooling over dropout for low-resource speech recognition.
Keywords :
speech recognition; stochastic processes; DNN-HMM framework; SPM network; benchmark testing; low-resource speech recognition condition; softmax normalization; stochastic pooling maxout networks; stochastic pooling regularization method; Feature extraction; Neural networks; Speech; Speech recognition; Stochastic processes; Training; Training data; deep learning; low-resource; maxout network; speech recognition; stochastic pooling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854204
Filename :
6854204
Link To Document :
بازگشت