• DocumentCode
    3528534
  • Title

    Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM

  • Author

    Ijima, Yusuke ; Tachibana, Makoto ; Nose, Takashi ; Kobayashi, Takao

  • Author_Institution
    Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4157
  • Lastpage
    4160
  • Abstract
    This paper proposes a technique for emotional speech recognition which enables us to extract paralinguistic information as well as linguistic information contained in speech signal. The technique is based on style estimation and style adaptation using multiple-regression HMM. Recognition process consists of two stages. In the first stage, a style vector that represents the emotional expression category and intensity of its variation of input speech is estimated on a sentence-by-sentence basis. Then the acoustic models are adapted using the estimated style vector and standard HMM-based speech recognition is performed in the second stage. We assess the performance of the proposed technique on the recognition of acted emotional speech uttered by both professional narrators and non-professional speakers and show the effectiveness of the technique.
  • Keywords
    emotion recognition; hidden Markov models; regression analysis; speech recognition; emotional speech recognition; linguistic information; multiple-regression HMM; paralinguistic information extract; recognition process; speech signal; style adaptation; style estimation; Adaptation model; Data mining; Emotion recognition; Hidden Markov models; Loudspeakers; Nose; Probability density function; Speech processing; Speech recognition; multiple-regression HMM (MRHMM); speaker adaptation; style adaptation; style estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960544
  • Filename
    4960544