• DocumentCode
    337443
  • Title

    A fast, sequential decoding algorithm with application to speaker verification

  • Author

    Li, Qi

  • Author_Institution
    Multimedia Commun. Res. Lab., AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    605
  • Abstract
    To implement speaker verification (SV) technology for real-world applications with a large user population, the system cost becomes an important issue. One needs a fast algorithm which can support more users in a central telephone switch given the limited hardware, or can reduce the hardware requirement on a wireless handset. Li (see Proc. Int´l. Conf. on Spoken Language Processing, Sydney, 1998) proposed a fast, sequential decoding algorithm for left-to-right HMM. The algorithm is based on a sequential detection scheme which is asymptotically optimal in the sense of detecting a possible change in distribution as reliably and quickly as possible. In this paper, the algorithm is evaluated in a fixed-phrase SV system on a database with 23,578 utterances recorded from 100 speakers. The experimental results show that the decoding speed of the proposed algorithm is about 7 to 10 times faster than the Viterbi algorithm while the accuracy is in an acceptable level. The results indicate that the proposed algorithm can also be applied to speaker identification, utterance verification, audio segmentation, voice/silence detection and many other applications
  • Keywords
    hidden Markov models; sequential decoding; signal detection; speaker recognition; HMM; Viterbi algorithm; asymptotically optimal scheme; audio segmentation; central telephone switch; database; decoding speed; distribution change detection; experimental results; fast sequential decoding algorithm; fixed-phrase SV system; real-world applications; sequential detection; speaker identification; speaker verification; system cost; utterance verification; voice/silence detection; wireless handset; Change detection algorithms; Costs; Decoding; Hardware; Hidden Markov models; Natural languages; Switches; Telephone sets; Telephony; Wireless sensor networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759739
  • Filename
    759739