• DocumentCode
    131351
  • Title

    Improvement of an abstractive summarization evaluation tool using lexical-semantic relations and weighted syntax tags in Farsi language

  • Author

    Estiri, Ahmad ; Kahani, Mohsen ; Ghaemi, Hirad ; Abasi, Mohsen

  • Author_Institution
    Web Technol. Lab., Ferdowsi Univ. of Mashhad, Mashhad, Iran
  • fYear
    2014
  • fDate
    4-6 Feb. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In recent years, high increase in the amount of published web elements and the need to store, classify, restore, and process them have intensified the importance of natural language processing and its related tools such as automatic summarizers and machine translators. In this paper, a novel approach for evaluating automatic abstractive summarization system is proposed which can also be used in the other Natural Language Processing and Information Retrieval Applications. By comparing auto-abstracts (abstracts created by machine) with human abstracts (ideal abstracts created by human), the metrics introduced in the proposed tool can automatically measure the quality of auto-abstracts. Evidently, we can´t semantically compare texts of abstractive summaries by comparison of just their words´ appearance. So it is necessary to use a lexical database such as WordNet. We use FerdowsNet with a proper idea for Farsi language and it notably improves the evaluation results. This tool has been assessed by linguistic experts. This tool contains metric for determining the quality of summaries automatically by comparing them with summaries generated by humans (Ideal summaries). Evidently, we can´t semantically compare texts of abstractive summaries by comparison of just their words´ appearance and it is necessary to use a lexical database. We use this database with a proper idea together with Farsi parser in order to identify groups forming sentences and the results of evaluation improve significantly.
  • Keywords
    database management systems; information retrieval; language translation; natural language processing; Farsi language; Web elements; WordNet; abstractive summaries; abstractive summarization evaluation tool; automatic abstractive summarization system; human abstracts; information retrieval applications; lexical database; lexical semantic relations; linguistic experts; machine translators; natural language processing; weighted syntax tags; Abstracts; Databases; Equations; Measurement; Natural language processing; Semantics; Standards; Automatic Abstractive Summarizer; Evaluation; Farsi Natural Language Processing (NLP); Parse tree; Semantics; Sentences groups; parser;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems (ICIS), 2014 Iranian Conference on
  • Conference_Location
    Bam
  • Print_ISBN
    978-1-4799-3350-1
  • Type

    conf

  • DOI
    10.1109/IranianCIS.2014.6802594
  • Filename
    6802594