• DocumentCode
    669211
  • Title

    Forensic evidence reporting using GMM-UBM, JFA and I-vector methods: Application to Algerian Arabic dialect

  • Author

    Boulkenafet, Z. ; Bengherabi, Messaoud ; Harizi, Farid ; Nouali, Omar ; Mohamed, Cheriet

  • Author_Institution
    Centre de Dev. des Technol. Av., Algeria
  • fYear
    2013
  • fDate
    4-6 Sept. 2013
  • Firstpage
    404
  • Lastpage
    409
  • Abstract
    Nowadays, under controlled conditions the speaker verification systems based on the GMM-UBM paradigm show very good performance. However, in forensic investigation activities the conditions; in which recordings are acquired; are uncontrollable, a naive use of the baseline GMM-UBM system without feature normalization, model transformation and score normalization techniques yields to unreliable forensic reporting. In this paper, we investigate forensic reporting using corpus-based likelihood ratio evaluation; which gained popularity in recent years; using two state-of-the-art speaker recognition systems: The JFA system which models explicitly the speaker and session variability during training stage and the I-vector paradigm which models the total variability and use compensation techniques to handle session mismatch. The GMM-UBM, Joint Factor Analysis and I-vector systems are compared in verification performance using Half Total Error Rates (HTER) and in forensic reporting using TIPPET plots. Experimental results on an Algerian Arabic dialect under different telephonic recording conditions confirm the robustness of I-vector and JFA systems in handling cross-channel mismatch and highlight clearly the drastic deterioration of the performance of the GMM-UBM system.
  • Keywords
    digital forensics; natural language processing; speaker recognition; Algerian Arabic dialect; GMM-UBM paradigm; HTER; I-vector methods; JFA; TIPPET plots; compensation techniques; corpus-based likelihood ratio evaluation; cross-channel mismatch; forensic evidence reporting; half total error rates; joint factor analysis; session mismatch; session variability; speaker recognition system; speaker variability; speaker verification systems; telephonic recording conditions; unreliable forensic reporting; Databases; Feature extraction; Forensics; GSM; Signal processing; Speaker recognition; Speech; GMM-UBM; I-vector; Joint Factor Analysis; evidence reporting; forensics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing and Analysis (ISPA), 2013 8th International Symposium on
  • Conference_Location
    Trieste
  • Type

    conf

  • DOI
    10.1109/ISPA.2013.6703775
  • Filename
    6703775