DocumentCode
118112
Title
The use of semantic and acoustic features for open-domain TED talk summarization
Author
Koto, Fajri ; Sakti, Sakriani ; Neubig, Graham ; Toda, Tomoki ; Adriani, Mima ; Nakamura, Satoshi
Author_Institution
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
1
Lastpage
4
Abstract
In this paper, we address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significant difficulties. The challenges increase not only how to handle disfluencies and fillers, but also how to extract topic-related meaningful messages within the free talks. Here, we propose to incorporate semantic and acoustic features within the speech summarization technique. In addition, we also propose a new evaluation method for speech summarization by checking semantic similarity between system and human summarization. Experiments results reveal that the proposed methods are effective in spontaneous speech summarization.
Keywords
acoustics; speech processing; speech recognition; acoustic features; automatic speech summarization; disfluency handling; filler handling; free talks; open-domain TED talk summarization; semantic features; semantic similarity checking; spontaneous speech summarization; topic-related meaningful message extraction; Accuracy; Acoustics; Computational linguistics; Feature extraction; Semantics; Speech; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location
Siem Reap
Type
conf
DOI
10.1109/APSIPA.2014.7041625
Filename
7041625
Link To Document