Title :
Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task
Author :
Stuker, Sebastian ; Muller, Mathias ; Quoc Bao Nguyen ; Waibel, Alex
Author_Institution :
Inst. for Anthropomatics, Karlsruhe Inst. of Technol., Karlsruhe, Germany
Abstract :
In the IARPA sponsored program BABEL we are faced with the challenge of training automatic speech recognition systems in sparse data conditions in very little time. In this paper we show that by using multilingual bootstrapping techniques in combination with multilingual deep belief bottle neck features that are only fine tuned on the target language the training time of an LVCSR system can be essentially halved while the word error rate stays the same. We show this for recognition systems on Tagalog, making use of multilingual systems trained on the other four languages of the Babel base period: Cantonese, Pashto, Turkish, and Vietnamese.
Keywords :
computational linguistics; error statistics; speech recognition; statistical analysis; training; vocabulary; BABEL ASR; Babel base period; Cantonese; IARPA; LVCSR system; Pashto; Tagalog; Turkish; Vietnamese; automatic speech recognition; multilingual bootstrapping technique; multilingual deep belief bottle neck feature; multilingual systems; performance improvement; sparse data conditions; training time reduction; word error rate; Acoustics; Context modeling; Data models; Hidden Markov models; Speech recognition; Training; Training data; automatic speech recognition; multilingual speech recognition; rapid system development;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854831