Title :
Multi-level adaptive networks in tandem and hybrid ASR systems
Author :
Bell, P. ; Swietojanski, Pawel ; Renals, Steve
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
Abstract :
In this paper we investigate the use of Multi-level adaptive networks (MLAN) to incorporate out-of-domain data when training large vocabulary speech recognition systems. In a set of experiments on multi-genre broadcast data and on TED lecture recordings we present results using of out-of-domain features in a hybrid DNN system and explore tandem systems using a variety of input acoustic features. Our experiments indicate using the MLAN approach in both hybrid and tandem systems results in consistent reductions in word error rate of 5-10% relative.
Keywords :
error analysis; speech recognition; vocabulary; TED lecture recordings; hybrid ASR systems; multigenre broadcast data; multilevel adaptive networks; out-of-domain data; vocabulary speech recognition systems; word error rate; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Speech; Speech recognition; Training; BBC; MLAN; TED; deep neural networks; hybrid; tandem;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639014