DocumentCode
1687731
Title
Multi-level adaptive networks in tandem and hybrid ASR systems
Author
Bell, P. ; Swietojanski, Pawel ; Renals, Steve
Author_Institution
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear
2013
Firstpage
6975
Lastpage
6979
Abstract
In this paper we investigate the use of Multi-level adaptive networks (MLAN) to incorporate out-of-domain data when training large vocabulary speech recognition systems. In a set of experiments on multi-genre broadcast data and on TED lecture recordings we present results using of out-of-domain features in a hybrid DNN system and explore tandem systems using a variety of input acoustic features. Our experiments indicate using the MLAN approach in both hybrid and tandem systems results in consistent reductions in word error rate of 5-10% relative.
Keywords
error analysis; speech recognition; vocabulary; TED lecture recordings; hybrid ASR systems; multigenre broadcast data; multilevel adaptive networks; out-of-domain data; vocabulary speech recognition systems; word error rate; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Speech; Speech recognition; Training; BBC; MLAN; TED; deep neural networks; hybrid; tandem;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639014
Filename
6639014
Link To Document