DocumentCode :
1950143
Title :
Stacked bottleneck features for speaker verification
Author :
Yao Tian ; Liang He ; Jia Liu
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2015
fDate :
12-15 July 2015
Firstpage :
514
Lastpage :
518
Abstract :
i-Vector modeling has shown to be effective for text independent speaker verification. It represents each utterance as a low-dimensional vector using factor analysis with a GMM supervector. In order to capture more complex speaker statistics, this paper proposes a new feature representation other than i-vectors for speaker verification using neural networks. In this work, stacked bottleneck features are extracted from cascade neural networks based on GMM supervectors. Dropout is integrated into the model to improve generalization error. We compare the proposed method with i-vector approach on NIST SRE2008 female short2-short3 telephone-telephone task. Experimental results demonstrate the efficacy of the proposed method.
Keywords :
Gaussian processes; feature extraction; mixture models; neural nets; signal representation; speaker recognition; text analysis; vectors; GMM supervector; Gaussian mixture model; NIST SRE2008 female short2-short3 telephone-telephone task; factor analysis; feature representation; generalization error improvement; i-vector modeling; low-dimensional vector; neural networks; speaker statistics; stacked bottleneck feature extraction; text independent speaker verification; Feature extraction; NIST; Neural networks; Speech; Speech recognition; Training; Training data; GMM supervector; bottleneck feature; deep neural network; speaker verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
Type :
conf
DOI :
10.1109/ChinaSIP.2015.7230456
Filename :
7230456
Link To Document :
بازگشت