DocumentCode :
542158
Title :
Automatic indexing of lecture speech by extracting topic-independent discourse markers
Author :
Kawahara, Tatsuya ; Hasegawa, Masahiro
Author_Institution :
School of Informatics, Kyoto University, Sakyo-ku, 606-8501, Japan
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
Automatic detection of section (sub-topic) boundaries in lecture speech is addressed. The method makes use of the characteristic expressions used in initial utterances of sections defined as discourse makers, as well as pause and language model information. The discourse markers are derived in a totally unsupervised manner based on word statistics used in the information retrieval technique. The statistics is used to select candidates picked up by other information. Experimental results show that the proposed method realizes better indexing performance (better precision at high recall rates) than the simple baseline method using pause information only. Moreover, it is shown to be robust against speech recognition errors.
Keywords :
Computational modeling; Machine assisted indexing; Manuals; Radio access networks; Soil; Speech; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743639
Filename :
5743639
Link To Document :
بازگشت