مرکز منطقه ای اطلاع رساني علوم و فناوري - A model structure integration based on a Bayesian framework for speech recognition

DocumentCode :

3165686

Title :

A model structure integration based on a Bayesian framework for speech recognition

Author :

Shiota, Sayaka ; Hashimoto, Kei ; Nankaku, Yoshihiko ; Tokuda, Keiichi

Author_Institution :

Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4813

Lastpage :

4816

Abstract :

This paper proposes an acoustic modeling technique based on Bayesian framework using multiple model structures for speech recognition. The Bayesian approach is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters, and its effectiveness in HMM-based speech recognition has been reported. Although the basic idea underlying the Bayesian approach is to treat all parameters as random variables, only one model structure is still selected in the conventional method. Multiple model structures are treated as latent variables in the proposed method and integrated based on the Bayesian framework. Furthermore, we applied deterministic annealing to the training algorithm to estimate appropriate acoustic models. The proposed method effectively utilizes multiple model structures, especially in the early stage of training and this leads to better predictive distributions and improvement of recognition performance.

Keywords :

Bayes methods; hidden Markov models; speech recognition; Bayesian framework; HMM-based speech recognition; acoustic modeling technique; deterministic annealing; hidden Markov model; latent variables; marginalizing model parameters; multiple model structures; predictive distributions; random variables; statistical technique; training algorithm; Annealing; Bayesian methods; Hidden Markov models; Periodic structures; Predictive models; Speech recognition; Training; Bayesian methods; Deterministic annealing; Hidden Markov model; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288996

Filename :

6288996

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3165686