• DocumentCode
    3049076
  • Title

    A Hybrid Speech Emotion Perception Method of VQ-based Feature Processing and ANN Recognition

  • Author

    Wenjing, Han ; Haifeng, Li ; Chunyu, Guo

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • Volume
    2
  • fYear
    2009
  • fDate
    19-21 May 2009
  • Firstpage
    145
  • Lastpage
    149
  • Abstract
    This paper constructs a VQ/ANN (vector quantization/artificial neural network) based speech emotion recognition system. The system first extracts the basic prosodic parameters and Mel-frequency cepstral coefficients (MFCC) frame by frame. Recent researches reveal that MFCC convey detailed emotional relevant information of syllable. However, the statistic measures of MFCC confuse the information at sentence level. Therefore, this paper proposes a VQ-based method different to statistic method to generate measures of MFCC. Then the combination of VQ-based MFCC measures and the statistic measures of prosodic parameters is used as input feature vector. The ANN is performed to process the combination features and the statistic measures of all extracted parameters respectively. The experiment results reveal that the combination features outperform the statistic measures. More detailed analysis indicates that the combination features could characterize the emotion space better than the statistic features. Besides, the rationality of VQ/ANN based framework is also demonstrated.
  • Keywords
    cepstral analysis; emotion recognition; neural nets; speech processing; vector quantisation; ANN recognition; VQ-based feature processing; artificial neural network; mel-frequency cepstral coefficients; speech emotion perception method; vector quantization; Artificial neural networks; Cepstral analysis; Data mining; Emotion recognition; Mel frequency cepstral coefficient; Performance evaluation; Speech processing; Speech recognition; Statistics; Vector quantization; artificial neural network; emtoion features; speech emotion recognition; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-0-7695-3571-5
  • Type

    conf

  • DOI
    10.1109/GCIS.2009.432
  • Filename
    5209400