DocumentCode
2279783
Title
Automatic transcription of spontaneous lecture speech
Author
Kawahara, Tatsuya ; Nanjo, Hiroaki ; Furui, Sadaoki
Author_Institution
Sch. of Informatics, Kyoto Univ., Japan
fYear
2001
fDate
2001
Firstpage
186
Lastpage
189
Abstract
We introduce our extensive projects on spontaneous speech processing and current trials of lecture speech recognition. A large corpus of lecture presentations and talks is being collected in the project. We have trained initial baseline models and confirmed significant difference of real lectures and written notes. In spontaneous lecture speech, the speaking rate is generally faster and changes a lot, which makes it harder to apply fixed segmentation and decoding settings. Therefore, we propose sequential decoding and speaking-rate dependent decoding strategies. The sequential decoder simultaneously performs automatic segmentation and decoding of input utterances. Then, the most adequate acoustic analysis, phone models and decoding parameters are applied according to the current speaking rate. These strategies achieve improvement on automatic transcription of real lecture speech.
Keywords
acoustic signal processing; learning (artificial intelligence); natural language interfaces; speech processing; speech recognition; text analysis; acoustic analysis; automatic transcription; lecture speech recognition; segmentation; sequential decoding; speaking rate; spontaneous speech processing; Automatic speech recognition; Broadcasting; Buildings; Decoding; Informatics; Loudspeakers; Speech analysis; Speech processing; Telephony; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034618
Filename
1034618
Link To Document