Title :
F0 range and peak alignment across speakers and emotions
Author :
Morley, Eric ; Van Santen, Jan ; Klabbers, Esther ; Kain, Alexander
Author_Institution :
Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
Abstract :
We present an analysis of F0 range and peak alignment in emotional speech from a heterogeneous group of speakers varying in age and gender. Both speaker and emotion had a strong effect on F0 range. Despite these large changes in the F0 trajectory, peak alignment was remarkably stable. Using the Linear Alignment Model (LAM), we show that the effects on alignment of emotion and speaker differences, al though statistically significant, are small. This stability results in a conclusion that peak alignment, unlike F0 range, does not appear to carry much information about speaker identity or emotional state. The LAM is effective in that it explains 42% of the variance in peak location on average, and furthermore it predicts the time of F0 peaks with an average RMS error of 12ms.
Keywords :
speaker recognition; F0 range; LAM; RMS error; emotional speech; linear alignment model; peak alignment across speakers; Analysis of variance; Correlation; Foot; Linear regression; Robustness; Speech; Trajectory; emotion recognition; human voice; speech analysis; speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947467