DocumentCode :
15402
Title :
Deep Neural Network Approaches to Speaker and Language Recognition
Author :
Richardson, Fred ; Reynolds, Douglas ; Dehak, Najim
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Volume :
22
Issue :
10
fYear :
2015
fDate :
Oct. 2015
Firstpage :
1671
Lastpage :
1675
Abstract :
The impressive gains in performance obtained using deep neural networks (DNNs) for automatic speech recognition (ASR) have motivated the application of DNNs to other speech technologies such as speaker recognition (SR) and language recognition (LR). Prior work has shown performance gains for separate SR and LR tasks using DNNs for direct classification or for feature extraction. In this work we present the application of single DNN for both SR and LR using the 2013 Domain Adaptation Challenge speaker recognition (DAC13) and the NIST 2011 language recognition evaluation (LRE11) benchmarks. Using a single DNN trained for ASR on Switchboard data we demonstrate large gains on performance in both benchmarks: a 55% reduction in EER for the DAC13 out-of-domain condition and a 48% reduction in Cavg on the LRE11 30 s test condition. It is also shown that further gains are possible using score or feature fusion leading to the possibility of a single i-vector extractor producing state-of-the-art SR and LR performance.
Keywords :
feature extraction; neural nets; speaker recognition; ASR; DAC13; DNN; LR; LRE11; automatic speech recognition; deep neural network approaches; direct classification; domain adaptation challenge speaker recognition; feature extraction; language recognition; language recognition evaluation; speaker recognition; speech technologies; switchboard data; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Speaker recognition; Speech; Speech recognition; Training; Bottleneck features; DNN; i-vector; language recognition; senone posteriors; speaker recognition; tandem features;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2015.2420092
Filename :
7080838
Link To Document :
بازگشت