Title :
Deep neural networks for small footprint text-dependent speaker verification
Author :
Variani, Ehsan ; Xin Lei ; McDermott, Erik ; Lopez Moreno, Ignacio ; Gonzalez-Dominguez, Jorge
Author_Institution :
Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-dependent speaker verification task. At development stage, a DNN is trained to classify speakers at the frame-level. During speaker enrollment, the trained DNN is used to extract speaker specific features from the last hidden layer. The average of these speaker features, or d-vector, is taken as the speaker model. At evaluation stage, a d-vector is extracted for each utterance and compared to the enrolled speaker model to make a verification decision. Experimental results show the DNN based speaker verification system achieves good performance compared to a popular i-vector system on a small footprint text-dependent speaker verification task. In addition, the DNN based system is more robust to additive noise and outperforms the i-vector system at low False Rejection operating points. Finally the combined system outperforms the i-vector system by 14% and 25% relative in equal error rate (EER) for clean and noisy conditions respectively.
Keywords :
decision making; feature extraction; neural nets; signal classification; speaker recognition; DNN based system; EER; additive noise; d-vector extraction; deep neural network; equal error rate; false rejection operating point; footprint text dependent speaker verification; i-vector system; speaker classification; speaker enrollment; speaker model; speaker specific feature extraction; verification decision making; Feature extraction; Neural networks; Noise measurement; Speaker recognition; Speech; Training; Vectors; Deep neural networks; speaker verification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854363