Title :
Automatic transcription of spontaneous lecture speech
Author :
Kawahara, Tatsuya ; Nanjo, Hiroaki ; Furui, Sadaoki
Author_Institution :
Sch. of Informatics, Kyoto Univ., Japan
Abstract :
We introduce our extensive projects on spontaneous speech processing and current trials of lecture speech recognition. A large corpus of lecture presentations and talks is being collected in the project. We have trained initial baseline models and confirmed significant difference of real lectures and written notes. In spontaneous lecture speech, the speaking rate is generally faster and changes a lot, which makes it harder to apply fixed segmentation and decoding settings. Therefore, we propose sequential decoding and speaking-rate dependent decoding strategies. The sequential decoder simultaneously performs automatic segmentation and decoding of input utterances. Then, the most adequate acoustic analysis, phone models and decoding parameters are applied according to the current speaking rate. These strategies achieve improvement on automatic transcription of real lecture speech.
Keywords :
acoustic signal processing; learning (artificial intelligence); natural language interfaces; speech processing; speech recognition; text analysis; acoustic analysis; automatic transcription; lecture speech recognition; segmentation; sequential decoding; speaking rate; spontaneous speech processing; Automatic speech recognition; Broadcasting; Buildings; Decoding; Informatics; Loudspeakers; Speech analysis; Speech processing; Telephony; Testing;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034618