DocumentCode :
1204234
Title :
Energy onset times for speaker identification
Author :
Quatieri, T.F. ; Jankowski, C.R., Jr. ; Reynolds, D.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Volume :
1
Issue :
11
fYear :
1994
Firstpage :
160
Lastpage :
162
Abstract :
Onset times of resonant energy pulses are measured with the high-resolution Teager operator and used as features in the Reynolds Gaussian-mixture speaker identification algorithm. Feature sets are constructed with primary pitch and secondary pulse locations derived from low and high speech formants. Preliminary testing was performed with a confusable 40-speaker subset from the NTIMIT (telephone channel) database. Speaker identification improved from 55 to 70% correct classification when the full set of new resonant energy-based features were added as an independent stream to conventional mel-cepstra.<>
Keywords :
parameter estimation; speech analysis and processing; speech recognition; Reynolds Gaussian-mixture speaker identification algorithm; classification; confusable 40-speaker subset; high-resolution Teager operator; onset times; primary pitch location; resonant energy pulses; resonant energy-based features; secondary pulse location; speaker identification; speech formants; Amplitude estimation; Chirp modulation; Energy measurement; Fluctuations; Frequency estimation; Performance evaluation; Pulse measurements; Resonance; Signal analysis; Speech;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/97.335062
Filename :
335062
Link To Document :
بازگشت