DocumentCode
3426091
Title
Voice source cepstrum coefficients for speaker identification
Author
Gudnason, Jon ; Brookes, Mike
Author_Institution
Dept. of Electr. & Electron. Eng., Imperial Coll. London, London
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4821
Lastpage
4824
Abstract
We propose a novel feature set for speaker recognition that is based on the voice source signal. The feature extraction process uses closed-phase LPC analysis to estimate the vocal tract transfer function. The LPC spectrum envelope is converted to cepstrum coefficients which are used to derive the voice source features. Unlike approaches based on inverse-filtering, our procedure is robust to LPC analysis errors and low-frequency phase distortion. We have performed text-independent closed-set speaker identification experiments on the TIMIT and the YOHO databases using a standard Gaussian mixture model technique. Compared to using mel- frequency cepstrum coefficients, the misclassification rate for the TIMIT database reduced from 1.51% to 0.16% when combined with the proposed voice source features. For the YOHO database the mis- classification rate decreased from 13.79% to 10.07%. The new feature vector also compares favourably to other proposed voice source feature sets.
Keywords
cepstral analysis; feature extraction; speaker recognition; Gaussian mixture model; TIMIT database; YOHO database; closed-phase LPC spectrum analysis; feature extraction; low-frequency phase distortion; speaker identification; speaker recognition; vocal tract transfer function estimation; voice source cepstrum coefficients; Cepstral analysis; Cepstrum; Error analysis; Feature extraction; Linear predictive coding; Phase distortion; Robustness; Spatial databases; Speaker recognition; Transfer functions; Cepstral Analysis; Speaker Recognition; Speech Analysis; Vocal Systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518736
Filename
4518736
Link To Document