Title :
Energy onset times for speaker identification
Author :
Quatieri, T.F. ; Jankowski, C.R., Jr. ; Reynolds, D.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
Onset times of resonant energy pulses are measured with the high-resolution Teager operator and used as features in the Reynolds Gaussian-mixture speaker identification algorithm. Feature sets are constructed with primary pitch and secondary pulse locations derived from low and high speech formants. Preliminary testing was performed with a confusable 40-speaker subset from the NTIMIT (telephone channel) database. Speaker identification improved from 55 to 70% correct classification when the full set of new resonant energy-based features were added as an independent stream to conventional mel-cepstra.<>
Keywords :
parameter estimation; speech analysis and processing; speech recognition; Reynolds Gaussian-mixture speaker identification algorithm; classification; confusable 40-speaker subset; high-resolution Teager operator; onset times; primary pitch location; resonant energy pulses; resonant energy-based features; secondary pulse location; speaker identification; speech formants; Amplitude estimation; Chirp modulation; Energy measurement; Fluctuations; Frequency estimation; Performance evaluation; Pulse measurements; Resonance; Signal analysis; Speech;
Journal_Title :
Signal Processing Letters, IEEE