• DocumentCode
    1184326
  • Title

    New LP-derived features for speaker identification

  • Author

    Assaleh, Khaled T. ; Mammone, Richard J.

  • Author_Institution
    CAIP Center, Rutgers Univ., Piscataway, NJ, USA
  • Volume
    2
  • Issue
    4
  • fYear
    1994
  • fDate
    10/1/1994 12:00:00 AM
  • Firstpage
    630
  • Lastpage
    638
  • Abstract
    A new set of features is introduced that has been found to improve the performance of automatic speaker identification systems, The new set of features is referred to as the adaptive component weighting (ACW) cepstral coefficients. The new features emphasize the formant structure of the speech spectrum while attenuating the broad-bandwidth spectral components. The attenuated components correspond to the variations in spectral tilt of transmission and recording environment, and other characteristics that are irrelevant to speaker identification. The resulting ACW spectrum introduces zeros into the usual all-pole linear prediction (LP) spectrum. This is equivalent to applying a finite impulse response (FIR) filter that normalizes the narrow-band modes of the spectrum. Unlike existing fixed cepstral weighting schemes, the ACW cepstrum provides an adaptively weighted version of the LP cepstrum. The adaptation results in deemphasizing the irrelevant variations of the LP cepstral coefficients on a frame-by-frame basis. The ACW features are evaluated for text-independent speaker identification and are shown to yield improved performance
  • Keywords
    digital filters; filtering and prediction theory; spectral analysis; speech analysis and processing; speech recognition; FIR filter; adaptive component weighting; all-pole linear prediction spectrum; attenuated components; automatic speaker identification systems; broad-bandwidth spectral components; cepstral coefficients; cepstral features processing; finite impulse response filter; formant structure; narrow-band modes; recording environment; spectral tilt; speech spectrum; system performance; text-independent speaker identification; transmission environment; Cepstral analysis; Cepstrum; Feature extraction; Filter bank; Finite impulse response filter; Predictive models; Speaker recognition; Spectral analysis; Speech analysis; Transfer functions;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.326621
  • Filename
    326621