Title :
Multi-scale modulation filtering in automatic detection of emotions in telephone speech
Author :
Pohjalainen, Jouni ; Alku, Paavo
Author_Institution :
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
Abstract :
This study investigates emotion detection from noise-corrupted telephone speech. A generic modulation filtering approach for audio pattern recognition is proposed that utilizes inherent long-term properties of acoustic features in different classes. When applied to binary classification along the activation and valence dimensions, filtering the baseline short-time timbral features in both the training and detection phase leads to significant improvement especially in noise robustness. Automatic selection of training data based on the filter´s prediction residual further improves the results.
Keywords :
emotion recognition; filtering theory; signal classification; speech recognition; acoustic features; activation dimensions; audio pattern recognition; automatic emotion detection; baseline short-time timbral features; binary classification; generic modulation filtering; multiscale modulation filtering; noise robustness; noise-corrupted telephone speech; valence dimensions; Acoustics; Feature extraction; Modulation; Noise; Speech; Speech processing; Speech recognition; computational paralinguistics; emotion detection; speech analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853743