Title :
Progress in Broadcast News transcription at Dragon Systems
Author :
Wegmann, Steven ; Zhan, Puming ; Gillick, Larry
Author_Institution :
Dragon Syst. Inc., Newton, MA, USA
Abstract :
We report on progress in acoustic modelling and preprocessing in our Broadcast News transcription system. We have gone back to basics in acoustic modelling, and re-examined some of our standard practices, in particular the use of IMELDA and frequency warping, in the context of the Broadcast News corpus. We also report on some preliminary experiments with a generalization of IMELDA, “semi-tied covariances”. In combination, these improvements lead to a 3.5% absolute improvement over our eval97 models. We also describe our attempts to fix our rather primitive, silence-based preprocessing system, including initial results using a new speaker-change detection algorithm based on Hotelling´s T2-test
Keywords :
speech recognition; Broadcast News transcription; Dragon Systems; Hotelling´s T2-test; IMELDA; acoustic modelling; frequency warping; preprocessing; semi-tied covariance; silence-based preprocessing system; speaker-change detection algorithm; Cepstral analysis; Context modeling; Degradation; Detection algorithms; Frequency; Gaussian processes; Radio broadcasting; Speech; TV broadcasting; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758055