Title :
A model structure integration based on a Bayesian framework for speech recognition
Author :
Shiota, Sayaka ; Hashimoto, Kei ; Nankaku, Yoshihiko ; Tokuda, Keiichi
Author_Institution :
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
Abstract :
This paper proposes an acoustic modeling technique based on Bayesian framework using multiple model structures for speech recognition. The Bayesian approach is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters, and its effectiveness in HMM-based speech recognition has been reported. Although the basic idea underlying the Bayesian approach is to treat all parameters as random variables, only one model structure is still selected in the conventional method. Multiple model structures are treated as latent variables in the proposed method and integrated based on the Bayesian framework. Furthermore, we applied deterministic annealing to the training algorithm to estimate appropriate acoustic models. The proposed method effectively utilizes multiple model structures, especially in the early stage of training and this leads to better predictive distributions and improvement of recognition performance.
Keywords :
Bayes methods; hidden Markov models; speech recognition; Bayesian framework; HMM-based speech recognition; acoustic modeling technique; deterministic annealing; hidden Markov model; latent variables; marginalizing model parameters; multiple model structures; predictive distributions; random variables; statistical technique; training algorithm; Annealing; Bayesian methods; Hidden Markov models; Periodic structures; Predictive models; Speech recognition; Training; Bayesian methods; Deterministic annealing; Hidden Markov model; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288996