DocumentCode :
865875
Title :
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition
Author :
Seltzer, Michael L. ; Acero, Alex
Author_Institution :
Microsoft Res., Redmond, WA
Volume :
15
Issue :
1
fYear :
2007
Firstpage :
235
Lastpage :
245
Abstract :
One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data. A more economical approach is to collect telephone speech and then restrict the application to operate at the telephone bandwidth. However, this generally results in suboptimal performance compared to a wideband recognition system. In this paper, we propose a novel expectation-maximization (EM) algorithm in which wideband acoustic models are trained using a small amount of wideband speech and a larger amount of narrowband speech. We show how this algorithm can be incorporated into the existing training schemes of hidden Markov model (HMM) speech recognizers. Experiments performed using wideband speech and telephone speech demonstrate that the proposed mixed-bandwidth training algorithm results in significant improvements in recognition accuracy over conventional training strategies when the amount of wideband data is limited
Keywords :
bandwidth allocation; expectation-maximisation algorithm; hidden Markov models; speech recognition; telephony; HMM; expectation-maximization algorithm; hidden Markov model; mixed-bandwidth training algorithm; mixed-bandwidth training data; narrowband speech; telephone bandwidth; telephone speech; training wideband acoustic models; wideband speech recognition systems; Automatic speech recognition; Bandwidth; Costs; Hidden Markov models; Narrowband; Speech processing; Speech recognition; Telephony; Training data; Wideband; Acoustic modeling; bandwidth extension; hidden Markov models (HMMs); speech recognition; telephone speech;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.876774
Filename :
4032793
Link To Document :
بازگشت