DocumentCode :
3485678
Title :
Strategies for using MLP based features with limited target-language training data
Author :
Qian, Yanmin ; Xu, Ji ; Povey, Daniel ; Liu, Jia
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
354
Lastpage :
358
Abstract :
Recently there has been some interest in the question of how to build LVCSR systems when there is only a limited amount of acoustic training data in the target language, but possibly more plentiful data in other languages. In this paper we investigate approaches using MLP based features. We experiment with two approaches: One is based on Automatic Speech Attribute Transcription (ASAT), in which we train classifiers to learn articulatory features. The other approach uses only the target-language data and relies on combination of multiple MLPs trained on different subsets. After system combination we get large improvements of more than 10% relative versus a conventional baseline. These feature-level approaches may also be combined with other, model-level methods for the multilingual or low-resource scenario.
Keywords :
speech recognition; ASAT; LVCSR systems; MLP based features; acoustic training data; automatic speech attribute transcription; conventional baseline; feature-level approaches; low-resource scenario; model-level methods; multilingual scenario; target-language training data; Acoustics; Detectors; Feature extraction; Hidden Markov models; Speech; Training; Training data; Articulatory features; Low resource ASR; Multi-Layer Perceptrons; Tandem features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163957
Filename :
6163957
Link To Document :
بازگشت