Sinusoidal model-based analysis and classification of stressed speech

Author

Ramamohan, S. ; Dandapat, S.

Author_Institution

Intel Technol. India Pvt. Ltd., Bangalore, India

Volume

14

Issue

3

fYear

2006

fDate

5/1/2006 12:00:00 AM

Firstpage

737

Lastpage

746

Abstract

In this paper, a sinusoidal model has been proposed for characterization and classification of different stress classes (emotions) in a speech signal. Frequency, amplitude and phase features of the sinusoidal model are analyzed and used as input features to a stressed speech recognition system. The performances of sinusoidal model features are evaluated for recognition of different stress classes with a vector-quantization classifier and a hidden Markov model classifier. To find the effectiveness of these features for recognition of different emotions in different languages, speech signals are recorded and tested in two languages, Telugu (an Indian language) and English. Average stressed speech index values are proposed for comparing differences between stress classes in a speech signal. Results show that sinusoidal model features are successful in characterizing different stress classes in a speech signal. Sinusoidal features perform better compared to the linear prediction and cepstral features in recognizing the emotions in a speech signal.

Keywords

hidden Markov models; speech coding; speech recognition; vector quantisation; amplitude features; hidden Markov model classifier; phase features; sinusoidal model-based analysis; stressed speech classification; stressed speech recognition system; vector-quantization classifier; Cepstral analysis; Emotion recognition; Frequency; Hidden Markov models; Natural languages; Performance evaluation; Speech analysis; Speech recognition; Stress; Testing; Emotion characterization; sinusoidal model; speech recognition; stressed speech; stressed speech index;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TSA.2005.858071

Filename

1621189