Title :
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition
Author :
Seltzer, Michael L. ; Acero, Alex
Author_Institution :
Microsoft Res., Redmond, WA
Abstract :
One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data. A more economical approach is to collect telephone speech and then restrict the application to operate at the telephone bandwidth. However, this generally results in suboptimal performance compared to a wideband recognition system. In this paper, we propose a novel expectation-maximization (EM) algorithm in which wideband acoustic models are trained using a small amount of wideband speech and a larger amount of narrowband speech. We show how this algorithm can be incorporated into the existing training schemes of hidden Markov model (HMM) speech recognizers. Experiments performed using wideband speech and telephone speech demonstrate that the proposed mixed-bandwidth training algorithm results in significant improvements in recognition accuracy over conventional training strategies when the amount of wideband data is limited
Keywords :
bandwidth allocation; expectation-maximisation algorithm; hidden Markov models; speech recognition; telephony; HMM; expectation-maximization algorithm; hidden Markov model; mixed-bandwidth training algorithm; mixed-bandwidth training data; narrowband speech; telephone bandwidth; telephone speech; training wideband acoustic models; wideband speech recognition systems; Automatic speech recognition; Bandwidth; Costs; Hidden Markov models; Narrowband; Speech processing; Speech recognition; Telephony; Training data; Wideband; Acoustic modeling; bandwidth extension; hidden Markov models (HMMs); speech recognition; telephone speech;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.876774