DocumentCode :
312048
Title :
Transcribing radio news
Author :
Kubala, Francis ; Anastasakos, Tasos ; Jin, Hubert ; Nguyen, Long ; Schwartz, Richard
Author_Institution :
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
598
Abstract :
We have recently extended the capabilities of BBN´s large-vocabulary discrete-utterance speech recognition system (BYBLOS) to operate on raw audio recordings of radio news programming. The recordings are given to the system as large monolithic waveforms without any additional side-information. Our goal is to transcribe all speech in the input with the highest accuracy possible. The problem is very challenging because radio news programming has frequent changes in speaker, speaking style, dialect, accent, topic, channel and environmental conditions. Furthermore, the monolithic input presents new problems for recognition algorithms and language models since all useful boundaries (such as speaker turns or sentence ends) are unknown
Keywords :
audio recording; radio broadcasting; speech recognition; BBN Systems and Technologies; BYBLOS; accent; accuracy; channel; dialect; environmental conditions; language models; large monolithic waveforms; large-vocabulary discrete-utterance speech recognition system; monolithic input; radio news programming; radio news transcription; raw audio recordings; sentence ends; speaker turns; speaking style; topic; unknown boundaries; Audio recording; Background noise; Bandwidth; Hidden Markov models; Natural languages; Speech recognition; System testing; Telephony; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607432
Filename :
607432
Link To Document :
بازگشت