Title :
Semi-tied covariance matrices for acoustic models based on random forests of phonetic decision trees
Author :
Xue, Jian ; Che, Lili ; Zhao, Yunxin
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY
Abstract :
In this paper, we investigate combining semi-tied covariance matrices and random forests (RFs) based phonetic decision trees (PDTs) for acoustic modeling in conversational speech recognition. We first use the RF method to train multiple PDTs for each phone state unit, and generate multiple sets of acoustic models accordingly. We then apply semi-tied covariance matrices to each set of acoustic models to improve their fit to data. In decoding search we combine the likelihood scores from the multiple acoustic models for each speech frame. The viability of semi-tied covariance matrices with different tying classes are studied from their effects on the diversity of RF-based acoustic models as well as on the word accuracy of our task of telehealth automatic captioning. Experimental results indicate that semi-tied covariance matrices help enhance the diversity of the RFs-PDTs based acoustic models as well as increase word accuracy.
Keywords :
covariance matrices; decision trees; speech recognition; RF method; acoustic models; conversational speech recognition; multiple PDT; phonetic decision trees; random forests; semitied covariance matrices; telehealth automatic captioning; Computer science; Covariance matrix; Decision trees; Decoding; Diversity reception; Hidden Markov models; Radio frequency; Robustness; Speech recognition; Strontium; Random Forests; acoustic modeling; phonetic decision trees; semi-tied covariance matrices;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960622