Title of article

A new approach of audio emotion recognition

Author/Authors

Ooi، نويسنده , , Chien Shing and Seng، نويسنده , , Kah Phooi and Ang، نويسنده , , Li-Minn and Chew، نويسنده , , Li Wern Chew، نويسنده ,

Issue Information

روزنامه با شماره پیاپی سال 2014

Pages

12

From page

5858

To page

5869

Abstract

A new architecture of intelligent audio emotion recognition is proposed in this paper. It fully utilizes both prosodic and spectral features in its design. It has two main paths in parallel and can recognize 6 emotions. Path 1 is designed based on intensive analysis of different prosodic features. Significant prosodic features are identified to differentiate emotions. Path 2 is designed based on research analysis on spectral features. Extraction of Mel-Frequency Cepstral Coefficient (MFCC) feature is then followed by Bi-directional Principle Component Analysis (BDPCA), Linear Discriminant Analysis (LDA) and Radial Basis Function (RBF) neural classification. This path has 3 parallel BDPCA + LDA + RBF sub-paths structure and each handles two emotions. Fusion modules are also proposed for weights assignment and decision making. The performance of the proposed architecture is evaluated on eNTERFACE’05 and RML databases. Simulation results and comparison have revealed good performance of the proposed recognizer.

Keywords

prosodic features , MFCC feature , Audio emotion recognition , RBF neural network

Journal title

Expert Systems with Applications

Serial Year

2014

Journal title

Expert Systems with Applications

Record number

A new approach of audio emotion recognition

Ooi، نويسنده , , Chien Shing and Seng، نويسنده , , Kah Phooi and Ang، نويسنده , , Li-Minn and Chew، نويسنده , , Li Wern Chew، نويسنده ,

2355007