Title :
Evaluation of the affective valence of speech using pitch substructure
Author :
Cook, Norman D. ; Fujisawa, Takashi X. ; Takami, Kazuaki
Author_Institution :
Dept. of Informatics, Kansai Univ., Osaka, Japan
Abstract :
In order to study the relationship between emotion and intonation, a new technique is introduced for the extraction of the dominant pitches within speech utterances and the quasi-musical analysis of the multipitch structure. After the distribution of fundamental frequencies over the entire utterance has been obtained, the underlying pitch structure is determined using an unsupervised "cluster" (Gaussian mixtures) algorithm. The technique normally results in 3-6 pitch clusters per utterance that can then be evaluated in terms of their inherent dissonance, harmonic "tension", and "major or minor modality". Stronger dissonance and tension were found in utterances with negative affect, relative to utterances with positive affect. Most importantly, utterances that were evaluated as having positive or negative affect had significantly different modality values. Factor analysis showed that the measures involving multiple pitches were distinct from other acoustical measures, indicating that the pitch substructure is an independent factor contributing to the affective valence of speech prosody.
Keywords :
feature extraction; speech processing; statistical analysis; affective speech valence; dissonance; dominant pitch extraction; emotion; factor analysis; harmonic tension; intonation; modality values; multipitch structure; quasimusical analysis; speech prosody; speech utterances; unsupervised cluster algorithm; Acoustic measurements; Acoustic signal detection; Clustering algorithms; Frequency; Higher order statistics; Humans; Informatics; Natural languages; Speech analysis; Emotion; Gaussian clusters; fundamental frequency; harmony perception; intonation; prosody; speech;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.854115