Title :
Strategies for using MLP based features with limited target-language training data
Author :
Qian, Yanmin ; Xu, Ji ; Povey, Daniel ; Liu, Jia
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
Recently there has been some interest in the question of how to build LVCSR systems when there is only a limited amount of acoustic training data in the target language, but possibly more plentiful data in other languages. In this paper we investigate approaches using MLP based features. We experiment with two approaches: One is based on Automatic Speech Attribute Transcription (ASAT), in which we train classifiers to learn articulatory features. The other approach uses only the target-language data and relies on combination of multiple MLPs trained on different subsets. After system combination we get large improvements of more than 10% relative versus a conventional baseline. These feature-level approaches may also be combined with other, model-level methods for the multilingual or low-resource scenario.
Keywords :
speech recognition; ASAT; LVCSR systems; MLP based features; acoustic training data; automatic speech attribute transcription; conventional baseline; feature-level approaches; low-resource scenario; model-level methods; multilingual scenario; target-language training data; Acoustics; Detectors; Feature extraction; Hidden Markov models; Speech; Training; Training data; Articulatory features; Low resource ASR; Multi-Layer Perceptrons; Tandem features;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163957