DocumentCode :
134230
Title :
Labeling unsegmented sequence data with DNN-HMM and its application for speech recognition
Author :
Xiangang Li ; Xihong Wu
Author_Institution :
Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
10
Lastpage :
14
Abstract :
Recently, deep neural network (DNN) with hidden Markov model (HMM) has turned out to be a superior sequence learning framework, based on which significant improvements were achieved in many application tasks, such as automatic speech recognition (ASR). However, the training of DNN-HMM requires the pre-segmented training data, which can be generated using Gaussian Mixture Model (GMM) in ASR tasks. Thus, questions are raised by many researchers: can we train the DNN-HMM without GMM seeding, and what does it suggest if the answer is yes? In this research, we come up with the `yes´ answer by presenting forward-backward learning algorithm for DNN-HMM framework. Besides, a training procedure is proposed, in which, the training for context independent (CI) DNN-HMM is treated as the pre-training for context dependent (CD) DNN-HMM. To evaluate the contribution of this work, experiments on ASR task with the benchmark corpus TIMIT are performed, and the results demonstrate the effectiveness of this research.
Keywords :
hidden Markov models; learning (artificial intelligence); neural nets; speech recognition; ASR task; automatic speech recognition; benchmark corpus TIMIT; context dependent DNN-HMM; context independent DNN-HMM; deep neural network; forward-backward learning algorithm; hidden Markov model; sequence learning framework; training procedure; unsegmented sequence data labeling; Acoustics; Hidden Markov models; Neural networks; Speech; Speech recognition; Training; Viterbi algorithm; DNN-HMM; forward-backward algorithm; speech recognition; unsegmented sequence data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936622
Filename :
6936622
Link To Document :
بازگشت