• DocumentCode
    2182847
  • Title

    Forensic voice comparison with secular shibboleths - A hybrid fused gmm-multivariate likelihood ratio-based approach using alveolo-palatal fricative cepstral spectra

  • Author

    Rose, Phil

  • Author_Institution
    School of Language Studies, Australian National University, Australia
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5900
  • Lastpage
    5903
  • Abstract
    The suitability of voiceless fricative spectra for forensic voice comparison is explored within a Likelihood Ratio-based framework. Non-contemporaneous landline telephone recordings of 99 male Japanese speakers are compared using only tokens of their voiceless alveolo-patalal fricative [ç]. A subset of mean-cepstrally-subtracted LPC CCs from the fricative spectrum from dc to 5 kHz is used. GMM/UBM and multivariate likelihood ratios are extracted for the 99 target and 4851 non-target trials, and fused with logistic regression. An EER of 7.4% and log-LR cost of 0.26 is demonstrated. It is concluded that the [ç] spectrum does have some individualising potential.
  • Keywords
    maximum likelihood estimation; speaker recognition; LPC CC; UBM; alveolo-palatal fricative cepstral spectra; forensic voice comparison; fricative spectrum; fused GMM-multivariate likelihood ratio-based approach; noncontemporaneous landline telephone recordings; secular shibboleths; Cavity resonators; Cepstral analysis; Forensics; Speaker recognition; Speech; Tongue; Forensic Voice Comparison; GMM/UBM; Multivariate Likelihood Ratio; cepstrum; coronal fricative spectra;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947704
  • Filename
    5947704