Title :
Speaker identification with whispered speech using unvoiced-consonant phonemes
Author :
Juan Xu ; Heming Zhao
Author_Institution :
Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou, China
Abstract :
A whisper is a speech production mode used by us to protect our privacy. Due to the differences between whispered and neutral speech, in both excitation and vocal tract function, the performance of speaker identification systems trained with neutral speech degrades significantly. This paper describes a neutral/whisper mismatched closed-set speaker identification system. The acoustic characteristics of vowels and voiced consonants are different between whispered and neutral speech. The acoustic characteristics of unvoiced consonants are relatively similar between whispered and neutral speech. In order to improve system performance, a feature extraction algorithm based on linear frequency scale is applied in this paper. The static linear frequency cepstral coefficient vectors are extracted as features from neutral and whispered unvoiced consonants. The closed-set speaker ID system using unvoiced consonants based on linear frequency cepstral coefficients achieves an absolute improvement for speaker recognition.
Keywords :
acoustic signal processing; feature extraction; speaker recognition; acoustic characteristics; closed-set speaker ID system; excitation function; feature extraction algorithm; linear frequency scale; neutral speech; speaker identification; speaker recognition; speech production; static linear frequency cepstral coefficient vector; unvoiced-consonant phoneme; vocal tract function; voiced consonant; vowel; whispered speech; Adaptation models; Cepstral analysis; Feature extraction; Speech; Speech recognition; Training; consonants; linear frequency cepstral coefficient (LFCC); speaker identification; unvoiced; whispered speech;
Conference_Titel :
Image Analysis and Signal Processing (IASP), 2012 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-2547-9
DOI :
10.1109/IASP.2012.6425009