DocumentCode
2182847
Title
Forensic voice comparison with secular shibboleths - A hybrid fused gmm-multivariate likelihood ratio-based approach using alveolo-palatal fricative cepstral spectra
Author
Rose, Phil
Author_Institution
School of Language Studies, Australian National University, Australia
fYear
2011
fDate
22-27 May 2011
Firstpage
5900
Lastpage
5903
Abstract
The suitability of voiceless fricative spectra for forensic voice comparison is explored within a Likelihood Ratio-based framework. Non-contemporaneous landline telephone recordings of 99 male Japanese speakers are compared using only tokens of their voiceless alveolo-patalal fricative [ç]. A subset of mean-cepstrally-subtracted LPC CCs from the fricative spectrum from dc to 5 kHz is used. GMM/UBM and multivariate likelihood ratios are extracted for the 99 target and 4851 non-target trials, and fused with logistic regression. An EER of 7.4% and log-LR cost of 0.26 is demonstrated. It is concluded that the [ç] spectrum does have some individualising potential.
Keywords
maximum likelihood estimation; speaker recognition; LPC CC; UBM; alveolo-palatal fricative cepstral spectra; forensic voice comparison; fricative spectrum; fused GMM-multivariate likelihood ratio-based approach; noncontemporaneous landline telephone recordings; secular shibboleths; Cavity resonators; Cepstral analysis; Forensics; Speaker recognition; Speech; Tongue; Forensic Voice Comparison; GMM/UBM; Multivariate Likelihood Ratio; cepstrum; coronal fricative spectra;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947704
Filename
5947704
Link To Document