DocumentCode :
179602
Title :
Multilingual shifting deep bottleneck features for low-resource ASR
Author :
Quoc Bao Nguyen ; Gehring, Jonas ; Muller, Mathias ; Stuker, Sebastian ; Waibel, Alex
Author_Institution :
Karlsruhe Inst. of Technol., Karlsruhe, Germany
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
5607
Lastpage :
5611
Abstract :
In this work, we propose a deep bottleneck feature architecture that is able to leverage data from multiple languages. We also show that tonal features are helpful for non-tonal languages. Evaluations are performed on a low-resource conversational telephone speech transcription task in Bengali, while additional data for DBNF training is provided in Assamese, Pashto, Tagalog, Turkish, and Vietnamese. We obtain relative reductions of up to 17.3% and 9.4% WER over mono-lingual GMMs and DBNFs, respectively.
Keywords :
Gaussian processes; feature extraction; mixture models; natural language processing; speech recognition; Assamese; Bengali; DBNF training; Pashto; Tagalog; Turkish; Vietnamese; WER; low-resource ASR; low-resource conversational telephone speech transcription task; monolingual GMM; multilingual shifting deep bottleneck features; nontonal languages; tonal features; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Standards; Training; Deep Neural Networks; Low-Resource ASR; Multilingual Deep bottleneck features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854676
Filename :
6854676
Link To Document :
بازگشت