DocumentCode
1856682
Title
Fine structure features for speaker identification
Author
Jankowski, C.R. ; Quatieri, Thomas F. ; Reynolds, Und D A
Author_Institution
Lincoln Lab., MIT, Lexington, MA, USA
Volume
2
fYear
1996
fDate
7-10 May 1996
Firstpage
689
Abstract
The performance of speaker identification (SID) systems can be improved by the addition of the rapidly varying “fine structure” features of formant amplitude and/or frequency modulation and multiple excitation pulses. This paper shows how the estimation of such fine structure features can be improved further by obtaining better estimates of formant frequency locations and uncovering various sources of error in the feature extraction systems. Most female telephone speech showed “spurious” formants, due to distortion in the telephone network. Nevertheless, SID performance was greatest with these spurious formants as formant estimates. A new feature has also been identified which can increase SID performance: cepstral coefficients from noise in the estimated excitation waveform. Finally, statistical tools have been developed to explore the relative importance of features used for SID, with the ultimate goal of uncovering the source of the features that provide SID performance improvement
Keywords
cepstral analysis; feature extraction; frequency estimation; frequency modulation; speaker recognition; statistical analysis; telephone networks; cepstral coefficients; error sources; estimated excitation waveform; feature estimation; feature extraction systems; female telephone speech; fine structure features; formant amplitude; formant frequency locations; frequency modulation; multiple excitation pulses; noise; speaker identification; spurious formants; statistical tools; systems performance; telephone network distortion; Degradation; Electrostatic precipitators; Energy measurement; Frequency estimation; Frequency measurement; Frequency modulation; Linear predictive coding; Pulse measurements; Speech; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.543214
Filename
543214
Link To Document