Title :
Emotional Speech Recognition Using Acoustic Models of Decomposed Component Words
Author :
Kaveeta, Vivatchai ; Patanukhom, Karn
Author_Institution :
Dept. of Comput. Eng., Chiang Mai Univ., Chiang Mai, Thailand
Abstract :
This paper presents a novel approach for emotional speech recognition. Instead of using a full length of speech for classification, the proposed method decomposes speech signals into component words, groups the words into segments and generates an acoustic model for each segment by using features such as audio power, MFCC, log attack time, spectrum spread and segment duration. Based on the proposed segment-based classification, unknown speech signals can be recognized into sequences of segment emotions. Emotion profiles (EPs) are extracted from the emotion sequences. Finally, speech emotion can be determined by using EP as features. Experiments are conducted by using 6,810 training samples and 722 test samples which are composed of eight emotional classes from IEMOCAP database. In comparison with a conventional method, the proposed method can improve recognition rate from 46.81% to 58.59% in eight emotion classification and from 60.18% to 71.25% in four emotion classification.
Keywords :
acoustic signal processing; audio signal processing; emotion recognition; feature extraction; signal classification; spectral analysis; speech recognition; support vector machines; EP; IEMOCAP database; MFCC; SVM; acoustic models; audio power; decomposed component words; emotion classification; emotion profiles extraction; emotional speech recognition; log attack time; recognition rate; segment duration; segment emotion sequences; segment-based classification; spectrum spread; speech classification; speech signal decomposition; support vector machine; word segmentation; Acoustics; Data models; Emotion recognition; Feature extraction; Speech; Speech recognition; Support vector machines; SVM; emotional classification; speech emotion;
Conference_Titel :
Pattern Recognition (ACPR), 2013 2nd IAPR Asian Conference on
Conference_Location :
Naha
DOI :
10.1109/ACPR.2013.13