• DocumentCode
    3071912
  • Title

    Application of neural networks in emotional speech recognition

  • Author

    Bojanic, Milana ; Crnojevic, Vladimir ; Delic, Vlado

  • Author_Institution
    Fac. of Tech. Sci., Univ. of Novi Sad, Novi Sad, Serbia
  • fYear
    2012
  • fDate
    20-22 Sept. 2012
  • Firstpage
    223
  • Lastpage
    226
  • Abstract
    Emotional speech recognition (ESR) from the aspect of human-machine interaction (HCI) is a prerequisite for the framework of interacting partners within the HCI. This paper addresses the application of neural network (NN) in ESR. The performance of NN is tested using three different feature sets which are basis for ESR: prosodic features, spectral features and a set of their combination. The results of these feature sets are compared using several network topologies and two training algorithms. It has been shown that using joint prosodic-spectral feature set as input to three layer feed-forward NN trained with back-propagation algorithm has the best performance in 5-class emotional speech recognition task.
  • Keywords
    backpropagation; emotion recognition; human computer interaction; neural nets; speech recognition; ESR; HCI; NN; backpropagation algorithm; emotional speech recognition; human-machine interaction; network topologies; neural network application; prosodic features; spectral features; Accuracy; Emotion recognition; Feature extraction; Network topology; Neurons; Speech; Speech recognition; emotional speech recognition; neural network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Network Applications in Electrical Engineering (NEUREL), 2012 11th Symposium on
  • Conference_Location
    Belgrade
  • Print_ISBN
    978-1-4673-1569-2
  • Type

    conf

  • DOI
    10.1109/NEUREL.2012.6420016
  • Filename
    6420016