Title :
Classifier-based learning of nonlinear feature manifold for visualization of emotional speech prosody
Author :
Vayrynen, E. ; Kortelainen, Jukka ; Seppanen, T.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Oulu, Oulu, Finland
Abstract :
Visualization of emotional speech data is an important tool for speech researchers who seek means to gain a deeper insight into the structure of complex multidimensional data. A visualization method is presented that utilizes feature selection and classifier optimization for learning Isomap manifolds of emotional speech data. The resulting manifold is based on those features that best discriminate between given emotional classes in the target space of specified embedding dimension. A nonlinear mapping function based on generalized regression neural networks (GRNNs) provides generalization for new data. A low-dimensional manifold of emotional speech data consisting of neutral, sad, angry, and happy expressions was constructed using prosodic and acoustic features of speech. Experimental results indicate that a 3D embedding provides the best classification performance. The manifold structure can be readily visualized and matches the circumplex and conical shapes predicted by dimensional models of emotion. Listening tests show excellent correlation between the organization of the data on the manifold and the listeners´ judgment of emotional intensity.
Keywords :
data visualisation; generalisation (artificial intelligence); learning (artificial intelligence); neural nets; pattern classification; speech processing; GRNN; Isomap manifold learning; acoustic speech feature; circumplex shape; classification performance; classifier optimization; classifier-based learning; conical shape; data generalization; embedding dimension; emotional class; emotional intensity; emotional speech data visualization; emotional speech prosody visualization; feature selection; generalized regression neural network; listening test; nonlinear feature manifold; nonlinear mapping function; prosodic speech feature; Biological system modeling; Data visualization; Estimation; Machine learning; Manifolds; Principal component analysis; Speech processing; Affect sensing and analysis; Biological system modeling; Data visualization; Estimation; Machine learning; Manifolds; Principal component analysis; Speech processing; feature evaluation and selection; information visualization; machine learning; modeling human emotion; nonlinear manifold learning; speech analysis;
Journal_Title :
Affective Computing, IEEE Transactions on
DOI :
10.1109/T-AFFC.2012.35