• DocumentCode
    118112
  • Title

    The use of semantic and acoustic features for open-domain TED talk summarization

  • Author

    Koto, Fajri ; Sakti, Sakriani ; Neubig, Graham ; Toda, Tomoki ; Adriani, Mima ; Nakamura, Satoshi

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we address the problem of automatic speech summarization on open-domain TED talks. The large vocabulary and diversity of topics from speaker-to-speaker presents significant difficulties. The challenges increase not only how to handle disfluencies and fillers, but also how to extract topic-related meaningful messages within the free talks. Here, we propose to incorporate semantic and acoustic features within the speech summarization technique. In addition, we also propose a new evaluation method for speech summarization by checking semantic similarity between system and human summarization. Experiments results reveal that the proposed methods are effective in spontaneous speech summarization.
  • Keywords
    acoustics; speech processing; speech recognition; acoustic features; automatic speech summarization; disfluency handling; filler handling; free talks; open-domain TED talk summarization; semantic features; semantic similarity checking; spontaneous speech summarization; topic-related meaningful message extraction; Accuracy; Acoustics; Computational linguistics; Feature extraction; Semantics; Speech; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
  • Conference_Location
    Siem Reap
  • Type

    conf

  • DOI
    10.1109/APSIPA.2014.7041625
  • Filename
    7041625