Title :
Unsupervised training for farsi-english speech-to-speech translation
Author :
Xiang, Bing ; Deng, Yonggang ; Gao, Yuqing
Author_Institution :
T. J. Watson Res. Center, IBM, Yorktown Heights, NY
fDate :
March 31 2008-April 4 2008
Abstract :
Speech-to-speech translation has evolved into an attractive area in recent years with significant progress made by various research groups. However, the translation engines usually suffer from the lack of bilingual training data, especially for low-resource languages. In this paper we present an unsupervised training technique to alleviate this problem by taking advantage of available source language data. Different approaches are proposed and compared through extensive experiments conducted on a speech-to-speech translation task between Farsi and English. The translation performance is significantly improved in both directions with the enhanced translation model. A state-of-the-art Farsi automatic speech recognition system is also established in this work.
Keywords :
language translation; natural language processing; speech processing; speech recognition; unsupervised learning; Farsi-English speech-to-speech translation; automatic speech recognition; bilingual training data; source language data; statistical machine translation; unsupervised training; Automatic speech recognition; Engines; Frequency estimation; Hidden Markov models; Iterative decoding; Natural languages; Probability; Speech recognition; Testing; Training data; Machine Translation; Speech Recognition; Unsupervised Training;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518775