DocumentCode :
1687694
Title :
Multi-task learning in deep neural networks for improved phoneme recognition
Author :
Seltzer, Michael L. ; Droppo, Jasha
Author_Institution :
Microsoft Res., Redmond, WA, USA
fYear :
2013
Firstpage :
6965
Lastpage :
6969
Abstract :
In this paper we demonstrate how to improve the performance of deep neural network (DNN) acoustic models using multi-task learning. In multi-task learning, the network is trained to perform both the primary classification task and one or more secondary tasks using a shared representation. The additional model parameters associated with the secondary tasks represent a very small increase in the number of trained parameters, and can be discarded at runtime. In this paper, we explore three natural choices for the secondary task: the phone label, the phone context, and the state context. We demonstrate that, even on a strong baseline, multi-task learning can provide a significant decrease in error rate. Using phone context, the phonetic error rate (PER) on TIMIT is reduced from 21.63% to 20.25% on the core test set, and surpassing the best performance in the literature for a DNN that uses a standard feed-forward network architecture.
Keywords :
acoustic signal processing; feedforward neural nets; learning (artificial intelligence); signal classification; signal representation; speech recognition; DNN acoustic models; PER; TIMIT; deep neural networks; feed-forward network architecture; multitask learning; network training; phone context; phone label; phoneme recognition; phonetic error rate; primary classification task; secondary tasks; shared representation; speech recognition; state context; Acoustics; Context; Hidden Markov models; Neural networks; Speech; Speech recognition; Training; Acoustic model; TIMIT; deep neural network; multi-task learning; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639012
Filename :
6639012
Link To Document :
بازگشت