Title :
Building an ensemble of CD-DNN-HMM acoustic model using random forests of phonetic decision trees
Author :
Tuo Zhao ; Yunxin Zhao ; Xin Chen
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
Abstract :
We propose an RF-PDT+CD-DNN approach to generate an ensemble of context-dependent pre-trained deep neural networks (CD-DNNs) using random forests of phonetic decision trees (RF-PDTs) and constructing a CD-DNN-HMM-based ensemble acoustic model (EAM). We present evaluation results on the TIMIT dataset and a telemedicine automatic captioning dataset and demonstrate that the proposed RF-PDT+CD-DNN based EAM significantly outperforms the CD-DNN based single acoustic model (SAM) in phone and word recognition accuracies.
Keywords :
decision trees; neural nets; speech recognition; telemedicine; CD-DNN-HMM acoustic model ensemble; EAM; RF-PDT+CD-DNN; SAM; TIMIT dataset; context-dependent pretrained deep neural networks; phone recognition accuracies; phonetic decision trees; random forests; random forests of phonetic decision trees; single acoustic model; telemedicine automatic captioning dataset; word recognition accuracies; deep neural network; discriminative pre-training; ensemble acoustic model; phonetic decision tree; random forest; single acoustic model;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936680