Title :
Summarization of spoken lectures based on linguistic surface and prosodic information
Author :
Togashi, S. ; Yamaguchi, M. ; Nakagawa, S.
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Toyohashi
Abstract :
We aim to extract automatically the summarization of spoken lectures for conferences and classes. For this purpose, at first we compared results of summarization extracted by human subjects. We found large differences with every subject. Then we investigated the relations between linguistic surface information and human results, and we obtained useful linguistic surface information. Next, we summarized spoken lectures on conferences and classes using the linguistic information. Additionally, we also focused on prosodic features; F0 and power. We conducted the same experiments on them. Lastly, we combined linguistic surface information and prosodic information. As a result, the proposed automatic summarization produced a better F- measure (0.599), k-value (0.420) and Rouge metric (0.758) comparable with human results.
Keywords :
natural language processing; text analysis; linguistic surface information; prosodic information; spoken lectures; summarization; Audio recording; Automatic speech recognition; Data mining; Decoding; Dynamic programming; Humans; Loudspeakers; Natural language processing; Natural languages; Speech recognition;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326810