• DocumentCode
    2329044
  • Title

    Robust prosodic features for speaker identification

  • Author

    Carey, Michael J. ; Parris, Eluned S. ; Lloyd-Thomas, Harvey ; Bennett, Stephen

  • Author_Institution
    Ensigma Ltd., Gwent, UK
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1800
  • Abstract
    The paper describes the use of prosodic features for speaker identification. Features based on the pitch and energy contours of speech are described and the relative importance of each feature for speaker identification is investigated. The mean and variance of the pitch period in voiced sections of speech are shown to be particularly useful at discriminating between speakers. Fusing these features with a hidden Markov model speaker identification system gave a marked improvement in figure of merit; over 30% gain was achieved on the six NIST 1995 evaluation tests presented. Handset variability is known to have an adverse effect on performance when traditional spectral features are used, e.g. cepstra. Results are presented showing that the prosodic features are more robust to handset variability
  • Keywords
    cepstral analysis; hidden Markov models; speaker recognition; NIST 1995 evaluation tests; cepstra; energy contours; handset variability; hidden Markov model speaker identification system; pitch contours; pitch period; robust prosodic features; speaker identification; spectral features; voiced sections; Digital signal processors; Hidden Markov models; NIST; Robustness; Signal processing algorithms; Speaker recognition; Speech coding; System testing; Telephone sets; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607979
  • Filename
    607979