مرکز منطقه ای اطلاع رساني علوم و فناوري - Deep neural network features and semi-supervised training for low resource speech recognition

DocumentCode :

1686243

Title :

Deep neural network features and semi-supervised training for low resource speech recognition

Author :

Thomas, Stephan ; Seltzer, Michael L. ; Church, Kenneth ; Hermansky, Hynek

Author_Institution :

Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2013

Firstpage :

6704

Lastpage :

6708

Abstract :

We propose a new technique for training deep neural networks (DNNs) as data-driven feature front-ends for large vocabulary continuous speech recognition (LVCSR) in low resource settings. To circumvent the lack of sufficient training data for acoustic modeling in these scenarios, we use transcribed multilingual data and semi-supervised training to build the proposed feature front-ends. In our experiments, the proposed features provide an absolute improvement of 16% in a low-resource LVCSR setting with only one hour of in-domain training data. While close to three-fourths of these gains come from DNN-based features, the remaining are from semi-supervised training.

Keywords :

neural nets; speech recognition; DNN-based features; acoustic modeling; data-driven feature front-ends; deep neural network features; in-domain training data; large vocabulary continuous speech recognition; low resource speech recognition; low-resource LVCSR setting; resource settings; semisupervised training; training data; transcribed multilingual data; Acoustics; Data models; Neural networks; Speech; Speech recognition; Training; Training data; Low resource; bottleneck features; deep neural networks; semi-supervised training; speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6638959

Filename :

6638959

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1686243