DocumentCode :
3429080
Title :
Maximum likelihood nonlinear transformations based on deep neural networks
Author :
Xiaodong Cui ; Goel, Vaibhava
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4320
Lastpage :
4324
Abstract :
This paper investigates modeling nonlinear transformations based on deep neural networks (DNNs). Specifically, a DNN is used as a nonlinear mapping function for feature space transformation for HMM acoustic models. The nonlinear transformations are estimated under the sequence-based maximum likelihood criterion. The likelihood partition function is evaluated using the Monte Carlo method based on importance sampling. The DNN is first pre-trained approximately to a linear transformation then followed by fine-tuning using the gradient descent algorithm. In addition, a deep stacked architecture is proposed that builds a DNN as a series of sub-networks hierarchically with each representing a nonlinear transformation. A block-wise learning strategy is introduced. LVCSR speaker adaptation experiments on the proposed maximum likelihood nonlinear transformation have shown superior results than the widely-used CMLLR transformation.
Keywords :
Monte Carlo methods; gradient methods; hidden Markov models; neural nets; speech recognition; CMLLR transformation; DNN; HMM acoustic models; LVCSR speaker adaptation; Monte Carlo method; blockwise learning strategy; deep neural networks; deep stacked architecture; feature space transformation; gradient descent algorithm; linear transformation; maximum likelihood nonlinear transformations; nonlinear mapping function; sequence based maximum likelihood criterion; Artificial neural networks; Hidden Markov models; Speech; Monte Carlo method; deep neural networks; importance sampling; maximum likelihood; nonlinear transformation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178786
Filename :
7178786
Link To Document :
بازگشت