DocumentCode :
284778
Title :
Augmented phonetic map for voice verification
Author :
Chang, Harry M.
Author_Institution :
NYNEX Science & Technology Inc., White Plains, NY, USA
Volume :
2
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
169
Abstract :
A perceptually based model for speaker identity verification (SIV) using derivative of phase spectrum (DPS) as the primary identity-bearing feature to model individual speakers´ vocal tract dynamics is presented. The basic technique used to model a speaker is to create a two-dimensional trajectory of changing vocal tract based on formant movement and pitch information. The map is further augmented with both instantaneous and dynamic feature parameters of DPS as well as with conventional energy-based acoustic features. A series of verification experiments was conducted, using a three-layer artificial neural network as a classifier, with an isolated digit database recorded over 11 different telephone handsets. The preliminary testing results suggest that this system performs significantly better than a baseline system using a standard cepstrum front-end
Keywords :
neural nets; speech recognition; derivative of phase spectrum; energy-based acoustic features; formant movement; isolated digit database; pitch information; speaker identity verification; telephone handsets; three-layer artificial neural network; vocal tract dynamics; voice verification; Cepstral analysis; Cepstrum; Filtering algorithms; Linear predictive coding; Loudspeakers; Robustness; Spatial databases; Speech analysis; Telephony; Time domain analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.226093
Filename :
226093
Link To Document :
بازگشت