• DocumentCode
    2796633
  • Title

    Revisiting the security of speaker verification systems against imposture using synthetic speech

  • Author

    De Leon, Phillip L. ; Apsingekar, Vijendra Raj ; Pucher, Michael ; Yamagishi, Junichi

  • Author_Institution
    Klipsch Sch. of Electr. & Comp. Eng., New Mexico State Univ., Las Cruces, NM, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    1798
  • Lastpage
    1801
  • Abstract
    In this paper, we investigate imposture using synthetic speech. Although this problem was first examined over a decade ago, dramatic improvements in both speaker verification (SV) and speech synthesis have renewed interest in this problem. We use a HMM-based speech synthesizer which creates synthetic speech for a targeted speaker through adaptation of a background model. We use two SV systems: standard GMM-UBM-based and a newer SVM-based. Our results show when the systems are tested with human speech, there are zero false acceptances and zero false rejections. However, when the systems are tested with synthesized speech, all claims for the targeted speaker are accepted while all other claims are rejected. We propose a two-step process for detection of synthesized speech in order to prevent this imposture. Overall, while SV systems have impressive accuracy, even with the proposed detector, high-quality synthetic speech will lead to an unacceptably high false acceptance rate.
  • Keywords
    Gaussian processes; hidden Markov models; security of data; speaker recognition; speech synthesis; support vector machines; GMM-UBM; Gaussian mixture model; HMM; SVM; hidden Markov model; security; speaker verification system; speech synthesis; support vector machine; synthetic speech; universal background model; zero false acceptance; zero false rejection; Covariance matrix; Hidden Markov models; Kernel; Security; Signal synthesis; Speech synthesis; Support vector machine classification; Support vector machines; Synthesizers; System testing; Security; Speaker recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495413
  • Filename
    5495413