شماره ركورد
440819
عنوان مقاله
The Role of large Monolingual Corpora in Improving MachineTranslation Quality;
عنوان به زبان ديگر
The Role of Large Monolingual Corpora in Improving Machine
Translation Qualityʹ
پديد آورندگان
Mosavi Miangah، Tayebeh نويسنده دانشگاه فردوسي مشهد; ,
اطلاعات موجودي
فصلنامه سال 1389 شماره 29
رتبه نشريه
علمي پژوهشي
تعداد صفحه
17
از صفحه
52
تا صفحه
68
كليدواژه
monolingual corpus , Machine translation , Persian language , collocations , Lexical ambiguity
چكيده لاتين
Collocations, recurrent combinations of words whose co-occurrence probability is higher than chance, are frequent in natural languages. Since bilingual dictionaries do not offer proper equivalents for most of such collocations, the majority of machine translation systems perform poorly when faced with collocations, and as a result, their output quality decreases to a high degree. Monolingual corpora have recently been used in solving various linguistic problems including natural language processing, statistical machine translation, language teaching and the like.
The present study describes the process of creating and exploiting a large monolingual corpus of Persian. This corpus enables us to solve the ambiguity problem of English collocations when translating into Persian with an English - Persian machine translation system. Using such a corpus as a target language corpus and an English-Persian bilingual dictionary, we study the efficiency of this corpus in finding the most appropriate Persian equivalents for English collocations in order to enhance the output quality of a machine translation system. The results of the experiment on a test corpus were very encouraging and achieved 90.83% success.
سال انتشار
1389
عنوان نشريه
مطالعات ترجمه
عنوان نشريه
مطالعات ترجمه
اطلاعات موجودي
فصلنامه با شماره پیاپی 29 سال 1389
كلمات كليدي
#تست#آزمون###امتحان
لينک به اين مدرک