DocumentCode
2329044
Title
Robust prosodic features for speaker identification
Author
Carey, Michael J. ; Parris, Eluned S. ; Lloyd-Thomas, Harvey ; Bennett, Stephen
Author_Institution
Ensigma Ltd., Gwent, UK
Volume
3
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1800
Abstract
The paper describes the use of prosodic features for speaker identification. Features based on the pitch and energy contours of speech are described and the relative importance of each feature for speaker identification is investigated. The mean and variance of the pitch period in voiced sections of speech are shown to be particularly useful at discriminating between speakers. Fusing these features with a hidden Markov model speaker identification system gave a marked improvement in figure of merit; over 30% gain was achieved on the six NIST 1995 evaluation tests presented. Handset variability is known to have an adverse effect on performance when traditional spectral features are used, e.g. cepstra. Results are presented showing that the prosodic features are more robust to handset variability
Keywords
cepstral analysis; hidden Markov models; speaker recognition; NIST 1995 evaluation tests; cepstra; energy contours; handset variability; hidden Markov model speaker identification system; pitch contours; pitch period; robust prosodic features; speaker identification; spectral features; voiced sections; Digital signal processors; Hidden Markov models; NIST; Robustness; Signal processing algorithms; Speaker recognition; Speech coding; System testing; Telephone sets; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607979
Filename
607979
Link To Document