Title :
Broadcast news transcription
Author :
Kubala, Francis ; Jin, Hubert ; Matsoukas, Spyros ; Nguyen, Long ; Schwartz, Richard
Author_Institution :
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Abstract :
We describe our work on automatic transcription of radio and television news broadcasts. This problem is very challenging for large vocabulary speech recognition because of the frequent and unpredictable changes that occur in speaker, speaking style, topic, channel, and background conditions. Faced with such a problem, there is a strong tendency to try to carve the input into separable classes and deal with each one independently. In our early work on this problem, however, we are finding that the rewards for condition-specific techniques are disappointingly small. This is forcing us to look for general, robust, and adaptive algorithms for dealing with extremely variable data. We describe the BBN BYB-LOS recognition system configured to handle off-line transcription and we characterize the speech contained in the 1996 DARPA Hub-4 testbed. On the partitioned development test set, we achieved a 29.4% overall word error rate
Keywords :
acoustic signal processing; natural languages; speech recognition; speech recognition equipment; 1996 DARPA Hub-4 testbed; BBN BYB-LOS recognition system; automatic transcription; background conditions; broadcast news transcription; condition-specific techniques; large vocabulary speech recognition; off-line transcription; radio news; speaker; speaking style; television news; Adaptive algorithm; Decoding; Hidden Markov models; Radio broadcasting; Robustness; Signal processing algorithms; Speech recognition; TV broadcasting; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.599601