DocumentCode :
661406
Title :
Context-dependent deep neural networks for commercial Mandarin speech recognition applications
Author :
Jianwei Niu ; Lei Xie ; Lei Jia ; Na Hu
Author_Institution :
Shaanxi Provincial Key Lab. of Speech & Image Inf. Process. Sch. of Comput. Sci., Northwestern Polytech. Univ., Xi´an, China
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
5
Abstract :
Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have been successfully used in some commercial large-vocabulary English speech recognition systems. It has been proved that CD-DNN-HMMs significantly outperform the conventional context-dependent Gaussian mixture model (GMM)-HMMs (CD-GMM-HMMs). In this paper, we report our latest progress on CD-DNN-HMMs for commercial Mandarin speech recognition applications in Baidu. Experiments demonstrate that CD-DNN-HMMs can get relative 26% word error reduction and relative 16% sentence error reduction in Baidu´s short message (SMS) voice input and voice search applications, respectively, compared with state-of-the-art CD-GMM-HMMs trained using fMPE. To the best of our knowledge, this is the first time the performances of CD-DNN-HMMs are reported for commercial Mandarin speech recognition applications. We also propose a GPU on-chip speed-up training approach which can achieve a speed-up ratio of nearly two for DNN training.
Keywords :
graphics processing units; hidden Markov models; neural nets; speech recognition; Baidu short message; CD-DNN-HMM; DNN training; GPU on-chip speed-up training approach; commercial Mandarin speech recognition applications; commercial large-vocabulary English speech recognition systems; context-dependent Gaussian mixture model-HMM; context-dependent deep neural network hidden Markov models; fMPE; sentence error reduction; speed-up ratio; voice input; voice search; Accuracy; Graphics processing units; Hidden Markov models; Speech; Speech recognition; Training; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694268
Filename :
6694268
Link To Document :
بازگشت