• DocumentCode
    1524957
  • Title

    Evaluation of Speaker Verification Security and Detection of HMM-Based Synthetic Speech

  • Author

    De Leon, Phillip L. ; Pucher, Michael ; Yamagishi, Junichi ; Hernaez, Inma ; Saratxaga, Ibon

  • Author_Institution
    Klipsch School of Electrical and Computer Engineering, New Mexico State University (NMSU), Las Cruces, NM, USA
  • Volume
    20
  • Issue
    8
  • fYear
    2012
  • Firstpage
    2280
  • Lastpage
    2290
  • Abstract
    In this paper, we evaluate the vulnerability of speaker verification (SV) systems to synthetic speech. The SV systems are based on either the Gaussian mixture model–universal background model (GMM-UBM) or support vector machine (SVM) using GMM supervectors. We use a hidden Markov model (HMM)-based text-to-speech (TTS) synthesizer, which can synthesize speech for a target speaker using small amounts of training data through model adaptation of an average voice or background model. Although the SV systems have a very low equal error rate (EER), when tested with synthetic speech generated from speaker models derived from the Wall Street Journal (WSJ) speech corpus, over 81% of the matched claims are accepted. This result suggests vulnerability in SV systems and thus a need to accurately detect synthetic speech. We propose a new feature based on relative phase shift (RPS), demonstrate reliable detection of synthetic speech, and show how this classifier can be used to improve security of SV systems.
  • Keywords
    Adaptation models; Hidden Markov models; Speech; Support vector machines; Synthesizers; Training; Vectors; Security; speaker recognition; speech synthesis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2012.2201472
  • Filename
    6205335