DocumentCode :
336729
Title :
Improvements in recognition of conversational telephone speech
Author :
Peskin, Barbara ; Newman, Michael ; McAllaster, Don ; Nagesha, Venkatesh ; Richards, Hywel ; Wegmann, Steven ; Hunt, Melvyn ; Gillick, Larry
Author_Institution :
Dragon Syst. Inc., Newton, MA, USA
Volume :
1
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
53
Abstract :
This paper describes recent changes in Dragon´s speech recognition system which have markedly improved performance on conversational telephone speech. Key changes include: the conversion to modified perceptual linear prediction (PLP)-based cepstra from mel-cepstra; the replacement of our usual IMELDA transformation by a new transform using “semi-tied covariance”; a new multi-pass adaptation protocol; probabilities on alternate pronunciations in the lexicon; the addition of word-boundary tags in our acoustic models and the redistribution of model parameters to build fewer output distributions but with more mixture components per model
Keywords :
cepstral analysis; prediction theory; probability; speech recognition; telephony; Dragon speech recognition system; acoustic models; alternate pronunciations probability; conversational telephone speech recognition; lexicon; mixture components; model parameters redistribution; modified PLP-based cepstra; modified perceptual linear prediction; multi-pass adaptation protocol; output distributions; performance improvement; semi-tied covariance; transform; word-boundary tags; Acoustic testing; Broadcasting; Error analysis; Natural languages; Protocols; Signal processing; Speech recognition; Standards development; Switches; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758060
Filename :
758060
Link To Document :
بازگشت