DocumentCode :
118112
Title :
The use of semantic and acoustic features for open-domain TED talk summarization
Author :
Koto, Fajri ; Sakti, Sakriani ; Neubig, Graham ; Toda, Tomoki ; Adriani, Mima ; Nakamura, Satoshi
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
fYear :
2014
fDate :
9-12 Dec. 2014
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, we address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significant difficulties. The challenges increase not only how to handle disfluencies and fillers, but also how to extract topic-related meaningful messages within the free talks. Here, we propose to incorporate semantic and acoustic features within the speech summarization technique. In addition, we also propose a new evaluation method for speech summarization by checking semantic similarity between system and human summarization. Experiments results reveal that the proposed methods are effective in spontaneous speech summarization.
Keywords :
acoustics; speech processing; speech recognition; acoustic features; automatic speech summarization; disfluency handling; filler handling; free talks; open-domain TED talk summarization; semantic features; semantic similarity checking; spontaneous speech summarization; topic-related meaningful message extraction; Accuracy; Acoustics; Computational linguistics; Feature extraction; Semantics; Speech; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location :
Siem Reap
Type :
conf
DOI :
10.1109/APSIPA.2014.7041625
Filename :
7041625
Link To Document :
بازگشت