Multi-basis adaptive neural network for rapid adaptation in speech recognition

Author

Chunyang Wu ; Gales, Mark J. F.

Author_Institution

Eng. Dept., Cambridge Univ., Cambridge, UK

fYear

2015

fDate

19-24 April 2015

Firstpage

4315

Lastpage

4319

Abstract

Recent progress in acoustic modeling with deep neural network has significantly improved the performance of automatic speech recognition systems. However, it remains as an open problem how to rapidly adapt these networks with limited, unsupervised, data. Most existing methods to adapt a neural network involve modifying a large number of parameters thus rapid adaptation is not possible with these schemes. In this paper, the multi-basis adaptive neural network is proposed, a new neural network configuration which only requires very few parameters for adaptation. By modifying the topology of a single multi-layer perception, a set of sub-networks with restricted connectivity are introduced to collaboratively capture different acoustic properties. The outputs of those sub-networks are combined by speaker-dependent interpolation weights. In addition, the complete system can be optimized in an adaptive training fashion when non-homogeneous training data are used. The performance of unsupervised adaptation is evaluated on two datasets. It outperforms the speaker-independent hybrid DNN-HMM baseline both on the Broadcast News English and the AURORA-4 tasks.

Keywords

acoustic signal processing; adaptive signal processing; interpolation; multilayer perceptrons; speech recognition; acoustic modeling; automatic speech recognition systems; deep neural network; multibasis adaptive neural network; multilayer perception; neural network configuration; nonhomogeneous training data; speaker dependent interpolation weight; speaker-independent hybrid DNN-HMM; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Silicon; Speech; Training; Adaptation; deep neural network; speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178785

Filename

7178785