Title :
Cross-language transfer speech recognition using deep learning
Author :
Yue Zhao ; Xu, Yan M. ; Sun, Mei J. ; Xu, Xiao N. ; Hui Wang ; Yang, Guo S. ; Qiang Ji
Author_Institution :
Dept. of Autom., Minzu Univ. of China, Beijing, China
Abstract :
Cross-language transfer speech recognition aims to transform phoneme models for a source language to recognize a target language lacking labeled data and other linguistic resources. In this paper, sparse auto-encoder, a deep learning method, is introduced to derive shared speech features between source and target language using semi-supervised learning. It can extract the shared representation of phonemes between the source and target languages so that the target phones can be mapped to the appropriate phones of the source languages. The experimental results showed this method performs better on cross-language phones recognition than the method based on multilayer perceptron.
Keywords :
feature extraction; learning (artificial intelligence); natural language processing; sparse matrices; speech recognition; cross-language phone recognition; cross-language transfer speech recognition; deep learning method; labeled data; linguistic resources; phoneme models; semisupervised learning; shared phoneme representation extraction; source language; source language phones; sparse autoencoder; speech features; target phone mapping; Data models; Encyclopedias; Feature extraction; Hidden Markov models; Speech; Speech recognition; Target recognition;
Conference_Titel :
Control & Automation (ICCA), 11th IEEE International Conference on
Conference_Location :
Taichung
DOI :
10.1109/ICCA.2014.6871132