DocumentCode
3488438
Title
Vector quantization techniques for GMM based speaker verification
Author
Singh, Gurmeet ; Panda, Ashish ; Bhattacharyya, Souvik ; Srikanthan, Thambipillai
Author_Institution
Indian Inst. of Technol., Kanpur, India
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
This paper explores the novel application of two vector quantization algorithms, namely Linde, Buzo, Gray (1980) and K-means algorithm for efficient speaker verification. Automatic speaker verification (ASV) is a memory and compute intensive process, giving rise to area and latency concerns in the way of its implementation for real-time efficient embedded systems. The training schemes for computing the speaker models, such as the expectation maximization are highly iterative and contribute significantly to the overall complexity in the implementation of the system. We demonstrate the use of the LBG and the K-means algorithm to realize compute efficient training method. Models trained with the LBG algorithm achieves as much as 99.88% of EM accuracy, whilst K-means achieves as much as 99.91% of EM accuracy. Moreover, the EM computational complexity is almost twice that of LBG or K-means. Thus, using LBG and K-means algorithms for training Gaussian mixture speaker models for text-independent speaker verification, we show that, that they deliver comparable performance as the EM algorithm at significantly reduced computational complexity. Thus making them an ideal choice for low-cost applications.
Keywords
Gaussian processes; computational complexity; embedded systems; optimisation; speaker recognition; speech coding; vector quantisation; EM accuracy; EM computational complexity; GMM based speaker verification; Gaussian mixture model; Gaussian mixture speaker models training; K-means algorithm; LBG algorithm; Linde Buzo Gray algorithm; automatic speaker verification; efficient training method; expectation maximization; latency; real-time efficient embedded systems; speaker models; text-independent speaker verification; training schemes; vector quantization; Biometrics; Computational complexity; Convergence; Costs; Embedded computing; Embedded system; High performance computing; Iterative algorithms; Testing; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202295
Filename
1202295
Link To Document