The use of semantic and acoustic features for open-domain TED talk summarization

Author

Koto, Fajri ; Sakti, Sakriani ; Neubig, Graham ; Toda, Tomoki ; Adriani, Mima ; Nakamura, Satoshi

Author_Institution

Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan

fYear

2014

fDate

9-12 Dec. 2014

Firstpage

1

Lastpage

4

Abstract

In this paper, we address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significant difficulties. The challenges increase not only how to handle disfluencies and fillers, but also how to extract topic-related meaningful messages within the free talks. Here, we propose to incorporate semantic and acoustic features within the speech summarization technique. In addition, we also propose a new evaluation method for speech summarization by checking semantic similarity between system and human summarization. Experiments results reveal that the proposed methods are effective in spontaneous speech summarization.

Keywords

acoustics; speech processing; speech recognition; acoustic features; automatic speech summarization; disfluency handling; filler handling; free talks; open-domain TED talk summarization; semantic features; semantic similarity checking; spontaneous speech summarization; topic-related meaningful message extraction; Accuracy; Acoustics; Computational linguistics; Feature extraction; Semantics; Speech; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)

Conference_Location

Siem Reap

Type

conf

DOI

10.1109/APSIPA.2014.7041625

Filename

7041625