DocumentCode :
3317742
Title :
A Web-based unsupervised algorithm for learning transliteration model to improve translation of low-frequency proper names
Author :
Shia, Min-Shiang ; Lin, Jiun-Hung ; Yu, Scott ; Lu, Wen-Hsiang
Author_Institution :
Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
420
Lastpage :
425
Abstract :
In machine translation, cross-language information retrieval, and cross-language question answering, the problems of unknown term translation are difficult to be solved. Although we have proposed several effective Web-based term translation extraction methods exploring Web resources to deal with translation of frequent Web query terms. However, many low-frequency unknown terms are still difficult to be translated by using our previous Web-based term translation extraction methods. Therefore, in this paper we propose a two-stage hybrid translation extraction method, which is composed of our pervious Web-based term translation extraction method and a new Web-based transliteration method to improve translation of low-frequency unknown proper names. Additionally, to construct a good quality transliteration model, we also present a Web-based unsupervised learning algorithm to automatically collect diverse English-Chinese transliteration pairs from the Web. Experimental results showed that our new method can make great improvements for translation of unknown proper names.
Keywords :
Internet; information retrieval; language translation; natural languages; unsupervised learning; English-Chinese transliteration pair; Web-based term translation extraction method; Web-based unsupervised algorithm; cross-language information retrieval; cross-language question answering; learning transliteration model; low-frequency proper names; machine translation; Data mining; Dictionaries; Information retrieval; Machine learning; Natural languages; Training data; Unsupervised learning; Web pages; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598774
Filename :
1598774
Link To Document :
بازگشت