شماره ركورد كنفرانس
2139
عنوان مقاله
Preparing an accurate Persian POS tagger suitable for MT
عنوان به زبان ديگر
Preparing an accurate Persian POS tagger suitable for MT
پديدآورندگان
Shakeri Zakieh نويسنده , Riahi Noushin نويسنده , Khadivi Shahram نويسنده
تعداد صفحه
4
كليدواژه
POS tag , Persian POS , MT , smt
سال انتشار
1391
عنوان كنفرانس
نخستين كنفرانس بين المللي پردازش خط و زبان فارسي
زبان مدرك
فارسی
چكيده فارسي
In this paper an accurate Persian POS tagger suitable for MT is prepared. First a new set of POS tags is defined which is general and more usable for MT rather than detailed ones; Then an accurate tagged corpus is prepared with modifying Bijankhan corpus. Stanford POS tagger is trained on the modified Bijankhan, the resulting tagger gives a 99.36% accuracy which shows significant improvement over previous Persian taggers. Result of utilization of this tagger for statistical machine translation is investigated. Outputs show better performance compared to simple SMT, while using previous tagger in SMT drops the BLEU compared to simple SMT.
شماره مدرك كنفرانس
4474716
سال انتشار
1391
از صفحه
1
تا صفحه
4
سال انتشار
1391
لينک به اين مدرک