Title :
Constrained discriminative mapping transforms for unsupervised speaker adaptation
Author :
Chen, Langzhou ; Gales, Mark J F ; Chin, K.K.
Author_Institution :
Cambridge Res. Lab., Toshiba Res. Eur. Ltd., Cambridge, UK
Abstract :
Discriminative mapping transforms (DMTs) is an approach to robustly adding discriminative training to unsupervised linear adaptation transforms. In unsupervised adaptation DMTs are more robust to unreliable transcriptions than directly estimating adaptation transforms in a discriminative fashion. They were previously pro posed for use with MLLR transforms with the associated need to explicitly transform the model parameters. In this work the DMT is extended to CMLLR transforms. As these operate in the feature space, it is only necessary to apply a different linear transform at the front-end rather than modifying the model parameters. This is useful for rapidly changing speakers/environments. The performance of DMTs with CMLLR was evaluated on the WSJ 20k task. Experimental results show that DMTs based on constrained linear trans forms yield 3% to 6% relative gain over MLE transforms in unsupervised speaker adaptation.
Keywords :
speech recognition; transforms; ASR; CMLLR transform; MLE transform; WSJ 20k task; constrained DMT approach; constrained discriminative mapping transform approach; discriminative training; unsupervised linear adaptation transform; unsupervised speaker adaptation; Adaptation models; Equations; Maximum likelihood estimation; Smoothing methods; Training; Transforms; CMLLR; Discriminative Linear Transforms; Discriminative Mapping Transforms; Discriminative Training; Mininum Phone Error; Speaker Adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947565