Title :
Measuring fine structure in speech: application to speaker identification
Author :
Jankowski, C.R., Jr. ; Quatieri, T.F. ; Reynolds, D.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
The performance of systems for speaker identification (SID) can be quite good with clean speech, though much lower with degraded speech. Thus it is useful to search for new features for SID, particularly features that are robust over a degraded channel. This paper investigates features that are based on amplitude and frequency modulations of speech formants, high resolution measurement of fundamental frequency and location of “secondary pulses”, measured using a high-resolution energy operator. When these features are added to traditional features using an existing SID system with a 168 speaker telephone speech database, SID performance improved by as much as 4% for male speakers and 8.2% for female speakers
Keywords :
amplitude modulation; cepstral analysis; frequency modulation; jitter; speaker recognition; amplitude modulation; clean speech; degraded channel; degraded speech; female speakers; fine structure measurement; frequency modulation; fundamental frequency; high-resolution energy operator; jitter; male speakers; performance; secondary pulses; speaker identification; speech; telephone speech database; Degradation; Energy measurement; Energy resolution; Frequency measurement; Frequency modulation; Pulse measurements; Pulse modulation; Robustness; Speech; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479539