Title :
Augmented phonetic map for voice verification
Author_Institution :
NYNEX Science & Technology Inc., White Plains, NY, USA
Abstract :
A perceptually based model for speaker identity verification (SIV) using derivative of phase spectrum (DPS) as the primary identity-bearing feature to model individual speakers´ vocal tract dynamics is presented. The basic technique used to model a speaker is to create a two-dimensional trajectory of changing vocal tract based on formant movement and pitch information. The map is further augmented with both instantaneous and dynamic feature parameters of DPS as well as with conventional energy-based acoustic features. A series of verification experiments was conducted, using a three-layer artificial neural network as a classifier, with an isolated digit database recorded over 11 different telephone handsets. The preliminary testing results suggest that this system performs significantly better than a baseline system using a standard cepstrum front-end
Keywords :
neural nets; speech recognition; derivative of phase spectrum; energy-based acoustic features; formant movement; isolated digit database; pitch information; speaker identity verification; telephone handsets; three-layer artificial neural network; vocal tract dynamics; voice verification; Cepstral analysis; Cepstrum; Filtering algorithms; Linear predictive coding; Loudspeakers; Robustness; Spatial databases; Speech analysis; Telephony; Time domain analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.226093