مرکز منطقه ای اطلاع رساني علوم و فناوري - Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition

DocumentCode :

865875

Title :

Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition

Author :

Seltzer, Michael L. ; Acero, Alex

Author_Institution :

Microsoft Res., Redmond, WA

Volume :

Issue :

fYear :

2007

Firstpage :

235

Lastpage :

245

Abstract :

One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data. A more economical approach is to collect telephone speech and then restrict the application to operate at the telephone bandwidth. However, this generally results in suboptimal performance compared to a wideband recognition system. In this paper, we propose a novel expectation-maximization (EM) algorithm in which wideband acoustic models are trained using a small amount of wideband speech and a larger amount of narrowband speech. We show how this algorithm can be incorporated into the existing training schemes of hidden Markov model (HMM) speech recognizers. Experiments performed using wideband speech and telephone speech demonstrate that the proposed mixed-bandwidth training algorithm results in significant improvements in recognition accuracy over conventional training strategies when the amount of wideband data is limited

Keywords :

bandwidth allocation; expectation-maximisation algorithm; hidden Markov models; speech recognition; telephony; HMM; expectation-maximization algorithm; hidden Markov model; mixed-bandwidth training algorithm; mixed-bandwidth training data; narrowband speech; telephone bandwidth; telephone speech; training wideband acoustic models; wideband speech recognition systems; Automatic speech recognition; Bandwidth; Costs; Hidden Markov models; Narrowband; Speech processing; Speech recognition; Telephony; Training data; Wideband; Acoustic modeling; bandwidth extension; hidden Markov models (HMMs); speech recognition; telephone speech;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.876774

Filename :

4032793

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=865875