• DocumentCode
    2066947
  • Title

    Simplified Deformation Compensation for Emotional Speaker Recognition

  • Author

    Yang, Yingchun ; Wu, Tian ; Lv, Hongbing

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Zhejiang Univ., China
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Emotional speaker recognition has been investigated by a number of researchers, however, all the current approaches had flaws in the requirement of a large amount of emotional speech from speakers during training and even the emotional state of a user during testing, which hinder the commercialization of speaker recognition technology. We propose our method from novel view of MFCC deformation caused by pitch deviation, named pitch deviation-based cepstrum compensation (PDCC), which take into account the correlation between glottis and vocal tract. Our method is applied to two emotional speech corpus EPS and MASC with absolute IR (identification rate) increase by 10.1% for the former and 4.12% for the latter, which are promising results .
  • Keywords
    cepstral analysis; correlation methods; emotion recognition; speaker recognition; MFCC deformation; PDCC; emotional speaker recognition; glottis-vocal tract correlation; pitch deviation-based cepstrum compensation; simplified deformation compensation; Automatic speech recognition; Cepstrum; Commercialization; Educational institutions; Histograms; Loudspeakers; Mel frequency cepstral coefficient; Speaker recognition; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.89
  • Filename
    4730343