DocumentCode
2066947
Title
Simplified Deformation Compensation for Emotional Speaker Recognition
Author
Yang, Yingchun ; Wu, Tian ; Lv, Hongbing
Author_Institution
Coll. of Comput. Sci. & Technol., Zhejiang Univ., China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
Emotional speaker recognition has been investigated by a number of researchers, however, all the current approaches had flaws in the requirement of a large amount of emotional speech from speakers during training and even the emotional state of a user during testing, which hinder the commercialization of speaker recognition technology. We propose our method from novel view of MFCC deformation caused by pitch deviation, named pitch deviation-based cepstrum compensation (PDCC), which take into account the correlation between glottis and vocal tract. Our method is applied to two emotional speech corpus EPS and MASC with absolute IR (identification rate) increase by 10.1% for the former and 4.12% for the latter, which are promising results .
Keywords
cepstral analysis; correlation methods; emotion recognition; speaker recognition; MFCC deformation; PDCC; emotional speaker recognition; glottis-vocal tract correlation; pitch deviation-based cepstrum compensation; simplified deformation compensation; Automatic speech recognition; Cepstrum; Commercialization; Educational institutions; Histograms; Loudspeakers; Mel frequency cepstral coefficient; Speaker recognition; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.89
Filename
4730343
Link To Document