• DocumentCode
    1856682
  • Title

    Fine structure features for speaker identification

  • Author

    Jankowski, C.R. ; Quatieri, Thomas F. ; Reynolds, Und D A

  • Author_Institution
    Lincoln Lab., MIT, Lexington, MA, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    689
  • Abstract
    The performance of speaker identification (SID) systems can be improved by the addition of the rapidly varying “fine structure” features of formant amplitude and/or frequency modulation and multiple excitation pulses. This paper shows how the estimation of such fine structure features can be improved further by obtaining better estimates of formant frequency locations and uncovering various sources of error in the feature extraction systems. Most female telephone speech showed “spurious” formants, due to distortion in the telephone network. Nevertheless, SID performance was greatest with these spurious formants as formant estimates. A new feature has also been identified which can increase SID performance: cepstral coefficients from noise in the estimated excitation waveform. Finally, statistical tools have been developed to explore the relative importance of features used for SID, with the ultimate goal of uncovering the source of the features that provide SID performance improvement
  • Keywords
    cepstral analysis; feature extraction; frequency estimation; frequency modulation; speaker recognition; statistical analysis; telephone networks; cepstral coefficients; error sources; estimated excitation waveform; feature estimation; feature extraction systems; female telephone speech; fine structure features; formant amplitude; formant frequency locations; frequency modulation; multiple excitation pulses; noise; speaker identification; spurious formants; statistical tools; systems performance; telephone network distortion; Degradation; Electrostatic precipitators; Energy measurement; Frequency estimation; Frequency measurement; Frequency modulation; Linear predictive coding; Pulse measurements; Speech; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543214
  • Filename
    543214