Title :
Tandem system adaptation using multiple linear feature transforms
Author :
Wang, Y.-Q. ; Gales, Mark J.F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
Abstract :
Adaptation to speaker and environment changes is an essential part of current automatic speech recognition (ASR) systems. In recent years the use of multi-layer percpetrons (MLPs) has become increasingly common in ASR systems. A standard approach to handling speaker differences when using MLPs is to apply a global speaker-specific constrained MLLR (CMLLR) transform to the features prior to training or using the MLP. This paper considers the situation when there are both speaker and channel, communication link, differences in the data. A more powerful transform, front-end CMLLR (FE-CMLLR), is applied to the inputs to the MLP to represent the channel differences. Though global, these FE-CMLLR transforms vary from time-instance to time-instance. Experiments on a channel distorted dialect Arabic conversational speech recognition task indicates the usefulness of adapting MLP features using both CMLLR and FE-CMLLR transforms.
Keywords :
multilayer perceptrons; speaker recognition; transforms; ASR systems; FE-CMLLR transforms; MLPs; automatic speech recognition systems; channel distorted dialect Arabic conversational speech recognition task; environment changes; front-end CMLLR; global speaker-specific constrained MLLR; multilayer percpetrons; multiple linear feature transforms; tandem system adaptation; Acoustic distortion; Adaptation models; Neural networks; Silicon; Speech; Speech recognition; Transforms; MLP feature; acoustic model adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639209