• DocumentCode
    730672
  • Title

    Multi-basis adaptive neural network for rapid adaptation in speech recognition

  • Author

    Chunyang Wu ; Gales, Mark J. F.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4315
  • Lastpage
    4319
  • Abstract
    Recent progress in acoustic modeling with deep neural network has significantly improved the performance of automatic speech recognition systems. However, it remains as an open problem how to rapidly adapt these networks with limited, unsupervised, data. Most existing methods to adapt a neural network involve modifying a large number of parameters thus rapid adaptation is not possible with these schemes. In this paper, the multi-basis adaptive neural network is proposed, a new neural network configuration which only requires very few parameters for adaptation. By modifying the topology of a single multi-layer perception, a set of sub-networks with restricted connectivity are introduced to collaboratively capture different acoustic properties. The outputs of those sub-networks are combined by speaker-dependent interpolation weights. In addition, the complete system can be optimized in an adaptive training fashion when non-homogeneous training data are used. The performance of unsupervised adaptation is evaluated on two datasets. It outperforms the speaker-independent hybrid DNN-HMM baseline both on the Broadcast News English and the AURORA-4 tasks.
  • Keywords
    acoustic signal processing; adaptive signal processing; interpolation; multilayer perceptrons; speech recognition; acoustic modeling; automatic speech recognition systems; deep neural network; multibasis adaptive neural network; multilayer perception; neural network configuration; nonhomogeneous training data; speaker dependent interpolation weight; speaker-independent hybrid DNN-HMM; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Silicon; Speech; Training; Adaptation; deep neural network; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178785
  • Filename
    7178785