مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speech recognition based on a Bayesian prediction approach

DocumentCode :

1522792

Title :

Robust speech recognition based on a Bayesian prediction approach

Author :

Jiang, Hui ; Hirose, Keikichi ; Huo, Qiang

Volume :

Issue :

fYear :

1999

fDate :

7/1/1999 12:00:00 AM

Firstpage :

426

Lastpage :

440

Abstract :

We study a category of robust speech recognition problem in which mismatches exist between training and testing conditions, and no accurate knowledge of the mismatch mechanism is available. The only available information is the test data along with a set of pretrained Gaussian mixture continuous density hidden Markov models (CDHMMs). We investigate the problem from the viewpoint of Bayesian prediction. A simple prior distribution, namely constrained uniform distribution, is adopted to characterize the uncertainty of the mean vectors of the CDHMMs. Two methods, namely a model compensation technique based on Bayesian predictive density and a robust decision strategy called Viterbi Bayesian predictive classification are studied. The proposed methods are compared with the conventional Viterbi decoding algorithm in speaker-independent recognition experiments on isolated digits and TI connected digit strings (TIDTGITS), where the mismatches between training and testing conditions are caused by: (1) additive Gaussian white noise, (2) each of 25 types of actual additive ambient noises, and (3) gender difference. The experimental results show that the adopted prior distribution and the proposed techniques help to improve the performance robustness under the examined mismatch conditions

Keywords :

AWGN; Bayes methods; Gaussian processes; hidden Markov models; prediction theory; signal classification; speech recognition; Bayesian prediction; Bayesian predictive density; CDHMM; TI connected digit strings; Viterbi Bayesian predictive classification; Viterbi decoding algorithm; additive Gaussian white noise; additive ambient noise; automatic speech recognition; constrained uniform distribution; continuous density hidden Markov models; experimental results; gender difference; isolated digits; mean vectors uncertainty; mismatch conditions; model compensation technique; pretrained Gaussian mixture; prior distribution; robust decision strategy; robust speech recognition; speaker-independent recognition experiments; test data; testing conditions; training conditions; Additive white noise; Bayesian methods; Decoding; Hidden Markov models; Noise robustness; Predictive models; Speech recognition; Testing; Uncertainty; Viterbi algorithm;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.771309

Filename :

771309

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1522792