DocumentCode
180084
Title
Exploring one pass learning for deep neural network training with averaged stochastic gradient descent
Author
Zhao You ; Xiaorui Wang ; Bo Xu
Author_Institution
Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
fYear
2014
fDate
4-9 May 2014
Firstpage
6854
Lastpage
6858
Abstract
Deep neural network acoustic models have shown large improvement in performance over Gaussian mixture models (G-MMs) in recent studies. Typically, deep neural networks are trained based on the cross-entropy criterion using stochastic gradient descent (SGD). However, plain SGD requires scanning the whole training set many passes before reaching the asymptotic region, making it difficult to scale to large dataset. It has been established that the second order SGD can potentially reach its asymptotic region in one pass through the training dataset. However, since it involves expensive computing for the inverse of Hessian matrix in the loss function, its application is limited. Averaged stochastic gradient descent (ASGD) is proved simple and effective for one pass online learning. This paper investigates the ASGD algorithm for deep neural network training. We tested ASGD on the Mandarin Chinese record speech recognition task using deep neural networks. Experimental results show that the performance of one pass ASGD is very close to that of multiple passes SGD.
Keywords
Gaussian processes; Hessian matrices; acoustic signal processing; entropy; mixture models; neural nets; speech recognition; telecommunication computing; ASGD; G-MM; Gaussian mixture models; Mandarin Chinese record speech recognition task; acoustic models; asymptotic region; averaged stochastic gradient descent; cross-entropy criterion; deep neural network training; expensive computing; inverse Hessian matrix; one pass learning; training dataset; Acoustics; Neural networks; Optimization; Schedules; Speech recognition; Stochastic processes; Training; averaged stochastic gradient descent; deep neural network; one pass learning; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854928
Filename
6854928
Link To Document