• DocumentCode
    1224345
  • Title

    A Syllable Lattice Approach to Speaker Verification

  • Author

    Jin, Minho ; Soong, Frank K. ; Yoo, Chang D.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon
  • Volume
    15
  • Issue
    8
  • fYear
    2007
  • Firstpage
    2476
  • Lastpage
    2484
  • Abstract
    This paper proposes a syllable-lattice-based speaker verification algorithm for Mandarin Chinese input. For each speech utterance, a syllable lattice is generated with a speaker-independent large-vocabulary continuous speech recognition system in free syllable decoding. The verification decision is made based upon the likelihood ratio between a target-speaker model and a speaker-independent background model, computed on the decoded syllable lattice. The likelihood function is calculated efficiently in a forward algorithm by considering all paths in the lattice. The proposed algorithm was evaluated using a Mandarin Chinese database, where 1832 true and 26 250 impostor trials were recorded by 19 target speakers and 180 impostors. The average duration of each trial is 2 s long without silence. The target-speaker model was adapted from the speaker-independent background model using enrollment data of two minutes with silence. The proposed algorithm achieved an equal-error rate of 0.857% which is better than 1.21% of the hidden Markov model-based speaker verification algorithm without using syllable lattices. The equal-error rate was further reduced to 0.617% by incorporating the Goussian mixture model-universal background model algorithm with 2048 Gaussian kernels whose equal error rate is 0.990%.
  • Keywords
    Gaussian processes; hidden Markov models; natural language processing; speaker recognition; speech coding; Gaussian mixture model; Mandarin Chinese input; continuous speech recognition system; free syllable decoding; hidden Markov model; likelihood function; speaker verification algorithm; speaker-independent background model; speech utterance; syllable lattice approach; target-speaker model; Databases; Decoding; Error analysis; Hidden Markov models; Kernel; Lattices; Loudspeakers; Speaker recognition; Speech recognition; Testing; Lattice-based speaker adaptation; Mandarin Chinese; lattice rescoring; speaker recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.906181
  • Filename
    4317565