• DocumentCode
    15402
  • Title

    Deep Neural Network Approaches to Speaker and Language Recognition

  • Author

    Richardson, Fred ; Reynolds, Douglas ; Dehak, Najim

  • Author_Institution
    Lincoln Lab., MIT, Lexington, MA, USA
  • Volume
    22
  • Issue
    10
  • fYear
    2015
  • fDate
    Oct. 2015
  • Firstpage
    1671
  • Lastpage
    1675
  • Abstract
    The impressive gains in performance obtained using deep neural networks (DNNs) for automatic speech recognition (ASR) have motivated the application of DNNs to other speech technologies such as speaker recognition (SR) and language recognition (LR). Prior work has shown performance gains for separate SR and LR tasks using DNNs for direct classification or for feature extraction. In this work we present the application of single DNN for both SR and LR using the 2013 Domain Adaptation Challenge speaker recognition (DAC13) and the NIST 2011 language recognition evaluation (LRE11) benchmarks. Using a single DNN trained for ASR on Switchboard data we demonstrate large gains on performance in both benchmarks: a 55% reduction in EER for the DAC13 out-of-domain condition and a 48% reduction in Cavg on the LRE11 30 s test condition. It is also shown that further gains are possible using score or feature fusion leading to the possibility of a single i-vector extractor producing state-of-the-art SR and LR performance.
  • Keywords
    feature extraction; neural nets; speaker recognition; ASR; DAC13; DNN; LR; LRE11; automatic speech recognition; deep neural network approaches; direct classification; domain adaptation challenge speaker recognition; feature extraction; language recognition; language recognition evaluation; speaker recognition; speech technologies; switchboard data; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Speaker recognition; Speech; Speech recognition; Training; Bottleneck features; DNN; i-vector; language recognition; senone posteriors; speaker recognition; tandem features;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2015.2420092
  • Filename
    7080838