DocumentCode :
3426598
Title :
Automatic lecture transcription by exploiting presentation slide information for language model adaptation
Author :
Kawahara, Tatsuya ; Nemoto, Yusuke ; Akita, Yuya
Author_Institution :
Acad. Center for Comput. & Media Studies Sakyo-ku, Kyoto Univ., Kyoto
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4929
Lastpage :
4932
Abstract :
The paper addresses language model adaptation for automatic lecture transcription by fully exploiting presentation slide information used in the lecture. As the text in the presentation slides is small in its size and fragmentary in its content, a robust adaptation scheme is addressed by focusing on the keyword and topic information. Several methods are investigated and combined; first, global topic adaptation is conducted based on PLSA (probabilistic latent semantic analysis) using keywords appearing in all slides. Web text is also retrieved to enhance the relevant text. Then, local preference of the keywords are reflected with a cache model by referring to the slide used during each utterance. Experimental evaluations on real lectures show that the proposed method combining the global and local slide information achieves a significant improvement of recognition accuracy, especially in the detection rate of content keywords.
Keywords :
speech recognition; Web text; automatic lecture transcription; automatic speech recognition; cache model; global adaptation; language model adaptation; local adaptation; presentation slide information; probabilistic latent semantic analysis; robust adaptation scheme; Adaptation model; Audio recording; Automatic speech recognition; Conducting materials; Deafness; Error analysis; Microphones; Natural languages; Robustness; Speech recognition; PLSA; cache model; language model; lectures; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518763
Filename :
4518763
Link To Document :
بازگشت