• DocumentCode
    1783874
  • Title

    Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

  • Author

    Akagi, Masato ; Xiao Han ; Elbarougy, Reda ; Hamada, Yasuhiro ; Junfeng Li

  • Author_Institution
    Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Nomi, Japan
  • fYear
    2014
  • fDate
    27-29 Aug. 2014
  • Firstpage
    574
  • Lastpage
    577
  • Abstract
    Speech-to-speech translation (S2ST) is the process by which a spoken utterance in one language is used to produce a spoken output in another language. The conventional approach to S2ST has focused on processing linguistic information only by directly translating the spoken utterance from the source language to the target language without taking into account paralinguistic and non-linguistic information such as the emotional states at play in the source language. In this work, we explore how to deal with Para-and non-linguistic information among multiple languages, with a particular focus on speakers\´ emotional states, in S2ST scenarios called "affective S2ST." In our efforts to construct an effective system, we discuss (1) how to describe emotions in speech and how to model the perception/production of emotions and (2) the commonality and differences among multiple languages in the proposed model. We then use these discussions as context for (3) an examination of our "affective S2ST" system in operation.
  • Keywords
    emotion recognition; language translation; natural language processing; speech recognition; speech synthesis; affective S2ST system; affective speech-to-speech translation system; emotion perception-production; emotional speech recognition; emotional speech synthesis; nonlinguistic information; para-linguistic information; Acoustics; Databases; Emotion recognition; Production; Semantics; Speech; Speech recognition; Speech-to-speech translation (S2ST) system; emotion recognition/synthesis; multiple languages; paralinguistic and non-linguistic information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
  • Conference_Location
    Kitakyushu
  • Print_ISBN
    978-1-4799-5389-9
  • Type

    conf

  • DOI
    10.1109/IIH-MSP.2014.148
  • Filename
    6998394