• DocumentCode
    900182
  • Title

    Sinusoidal model-based analysis and classification of stressed speech

  • Author

    Ramamohan, S. ; Dandapat, S.

  • Author_Institution
    Intel Technol. India Pvt. Ltd., Bangalore, India
  • Volume
    14
  • Issue
    3
  • fYear
    2006
  • fDate
    5/1/2006 12:00:00 AM
  • Firstpage
    737
  • Lastpage
    746
  • Abstract
    In this paper, a sinusoidal model has been proposed for characterization and classification of different stress classes (emotions) in a speech signal. Frequency, amplitude and phase features of the sinusoidal model are analyzed and used as input features to a stressed speech recognition system. The performances of sinusoidal model features are evaluated for recognition of different stress classes with a vector-quantization classifier and a hidden Markov model classifier. To find the effectiveness of these features for recognition of different emotions in different languages, speech signals are recorded and tested in two languages, Telugu (an Indian language) and English. Average stressed speech index values are proposed for comparing differences between stress classes in a speech signal. Results show that sinusoidal model features are successful in characterizing different stress classes in a speech signal. Sinusoidal features perform better compared to the linear prediction and cepstral features in recognizing the emotions in a speech signal.
  • Keywords
    hidden Markov models; speech coding; speech recognition; vector quantisation; amplitude features; hidden Markov model classifier; phase features; sinusoidal model-based analysis; stressed speech classification; stressed speech recognition system; vector-quantization classifier; Cepstral analysis; Emotion recognition; Frequency; Hidden Markov models; Natural languages; Performance evaluation; Speech analysis; Speech recognition; Stress; Testing; Emotion characterization; sinusoidal model; speech recognition; stressed speech; stressed speech index;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.858071
  • Filename
    1621189