• DocumentCode
    1830855
  • Title

    Rank correlation analysis of NTCIR-10 RITE-2 Chinese datasets and evaluation metrics

  • Author

    Chuan-Jie Lin ; Cheng-Wei Lee ; Cheng-Wei Shih ; Wen-Lian Hsu

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nat. Taiwan Ocean Univ., Keelung, Taiwan
  • fYear
    2013
  • fDate
    14-16 Aug. 2013
  • Firstpage
    62
  • Lastpage
    68
  • Abstract
    Textual Entailment (TE) is the task of recognizing entailment, paraphrase, and contradiction relations between a given text pair. The goal of textual entailment research is to develop a core inference component that can be applied to various domains such as QA. We observed several rank correlations on the data and system results in the NTCIR-10 RITE-2 task, trying to find out correlations between datasets and evaluation metrics. We also constructed RITE4QA datasets in the RITE-2 task under the scenario of QA in order to see the applicability of RITE systems in QA. We find that datasets created from different sources and different ways can hardly predict each other. However, the system ranking on the dataset consisting of expert-made artificial pairs has moderate correlation with the ranking on QA metrics. Both RITE metrics and QA metrics are stable in terms of their own subtasks.
  • Keywords
    text analysis; NTCIR-10 RITE-2 Chinese datasets; NTCIR-10 RITE-2 task; QA metrics; RITE metrics; RITE systems; RITE4QA datasets; contradiction relations; core inference component; entailment recognition; evaluation metrics; expert-made artificial pairs; paraphrase recognition; rank correlation analysis; rank correlations; system ranking; text pair; textual entailment research; Accuracy; Correlation; Encyclopedias; Knowledge discovery; Measurement; Text recognition; Artificial Pairs; RITE; Rank Correlation; System Ranking Estimation Metrics; Textual Entailment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1109/IRI.2013.6642454
  • Filename
    6642454