• شماره ركورد كنفرانس
    2139
  • عنوان مقاله

    Preparing an accurate Persian POS tagger suitable for MT

  • عنوان به زبان ديگر
    Preparing an accurate Persian POS tagger suitable for MT
  • پديدآورندگان

    Shakeri Zakieh نويسنده , Riahi Noushin نويسنده , Khadivi Shahram نويسنده

  • تعداد صفحه
    4
  • كليدواژه
    POS tag , Persian POS , MT , smt
  • سال انتشار
    1391
  • عنوان كنفرانس
    نخستين كنفرانس بين المللي پردازش خط و زبان فارسي
  • زبان مدرك
    فارسی
  • چكيده فارسي
    In this paper an accurate Persian POS tagger suitable for MT is prepared. First a new set of POS tags is defined which is general and more usable for MT rather than detailed ones; Then an accurate tagged corpus is prepared with modifying Bijankhan corpus. Stanford POS tagger is trained on the modified Bijankhan, the resulting tagger gives a 99.36% accuracy which shows significant improvement over previous Persian taggers. Result of utilization of this tagger for statistical machine translation is investigated. Outputs show better performance compared to simple SMT, while using previous tagger in SMT drops the BLEU compared to simple SMT.
  • شماره مدرك كنفرانس
    4474716
  • سال انتشار
    1391
  • از صفحه
    1
  • تا صفحه
    4
  • سال انتشار
    1391