DocumentCode
3528534
Title
Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM
Author
Ijima, Yusuke ; Tachibana, Makoto ; Nose, Takashi ; Kobayashi, Takao
Author_Institution
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
fYear
2009
fDate
19-24 April 2009
Firstpage
4157
Lastpage
4160
Abstract
This paper proposes a technique for emotional speech recognition which enables us to extract paralinguistic information as well as linguistic information contained in speech signal. The technique is based on style estimation and style adaptation using multiple-regression HMM. Recognition process consists of two stages. In the first stage, a style vector that represents the emotional expression category and intensity of its variation of input speech is estimated on a sentence-by-sentence basis. Then the acoustic models are adapted using the estimated style vector and standard HMM-based speech recognition is performed in the second stage. We assess the performance of the proposed technique on the recognition of acted emotional speech uttered by both professional narrators and non-professional speakers and show the effectiveness of the technique.
Keywords
emotion recognition; hidden Markov models; regression analysis; speech recognition; emotional speech recognition; linguistic information; multiple-regression HMM; paralinguistic information extract; recognition process; speech signal; style adaptation; style estimation; Adaptation model; Data mining; Emotion recognition; Hidden Markov models; Loudspeakers; Nose; Probability density function; Speech processing; Speech recognition; multiple-regression HMM (MRHMM); speaker adaptation; style adaptation; style estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960544
Filename
4960544
Link To Document