• DocumentCode
    1787060
  • Title

    Proposing two speaker adaptaion methods for deep neural network based speech recognition systems

  • Author

    Ansari, Zohreh ; Salehi, Seyyed Ali Seyyed

  • Author_Institution
    Biomed. Eng. Dept., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    452
  • Lastpage
    457
  • Abstract
    Many researches have done to develop speech recognition systems in the past decades. However, their performance in speaker variabilities lags behind that of human recognition system. In order to solve this problem, speaker adaptation methods have proposed. These methods adapt either the acoustic model parameters or the input features of the speech recognition systems to improve their performance. In this article, two speaker adaptation methods for deep neural network based speech recognition systems are proposed. In the first method, feature vectors of each speaker are adapted nonlinearly after some forward-backward iterations. In the other one, the speech recognition system is modified in order to be able to adapt dynamically in speaker variabilities. This method, unlike other model adaptation methods, does not need to any adaptation data and adapts the model online. Experiments on FARSDAT dataset demonstrate that these methods improve phone recognition accuracy rate by 2% and 6%.
  • Keywords
    feature extraction; neural nets; speech recognition; vectors; FARSDAT dataset; acoustic model parameters; deep neural network based speech recognition systems; feature vectors; forward-backward iterations; input features; phone recognition accuracy rate; speaker adaptation methods; speaker variabilities; Acoustics; Adaptation models; Hidden Markov models; Neurons; Speech; Speech recognition; Training; deep neural networks; nonlinear normalization; speaker adaptation; speaker recognition; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2014 7th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4799-5358-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2014.7000746
  • Filename
    7000746