A fast, sequential decoding algorithm with application to speaker verification

Author

Li, Qi

Author_Institution

Multimedia Commun. Res. Lab., AT&T Bell Labs., Murray Hill, NJ, USA

Volume

2

fYear

1999

fDate

15-19 Mar 1999

Firstpage

605

Abstract

To implement speaker verification (SV) technology for real-world applications with a large user population, the system cost becomes an important issue. One needs a fast algorithm which can support more users in a central telephone switch given the limited hardware, or can reduce the hardware requirement on a wireless handset. Li (see Proc. Int´l. Conf. on Spoken Language Processing, Sydney, 1998) proposed a fast, sequential decoding algorithm for left-to-right HMM. The algorithm is based on a sequential detection scheme which is asymptotically optimal in the sense of detecting a possible change in distribution as reliably and quickly as possible. In this paper, the algorithm is evaluated in a fixed-phrase SV system on a database with 23,578 utterances recorded from 100 speakers. The experimental results show that the decoding speed of the proposed algorithm is about 7 to 10 times faster than the Viterbi algorithm while the accuracy is in an acceptable level. The results indicate that the proposed algorithm can also be applied to speaker identification, utterance verification, audio segmentation, voice/silence detection and many other applications

Keywords

hidden Markov models; sequential decoding; signal detection; speaker recognition; HMM; Viterbi algorithm; asymptotically optimal scheme; audio segmentation; central telephone switch; database; decoding speed; distribution change detection; experimental results; fast sequential decoding algorithm; fixed-phrase SV system; real-world applications; sequential detection; speaker identification; speaker verification; system cost; utterance verification; voice/silence detection; wireless handset; Change detection algorithms; Costs; Decoding; Hardware; Hidden Markov models; Natural languages; Switches; Telephone sets; Telephony; Wireless sensor networks;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location

Phoenix, AZ

ISSN

1520-6149

Print_ISBN

0-7803-5041-3

Type

conf

DOI

10.1109/ICASSP.1999.759739

Filename

759739