Title of article :
Thematic Similarity Multiple-Choice Question Answering with Doc2Vec: A Step Toward Metaphorical Language Processing
Author/Authors :
Akef, Soroosh Languages and Linguistics - Center Sharif University of Technology Tehran, Iran , Bokaei, Mohammad Hadi Department of Information Technology Iran Telecommunication - Research Center Tehran, Iran , Sameti, Hossein Department of Computer Engineering - Sharif University of Technology Tehran, Iran
Pages :
8
From page :
46
To page :
53
Abstract :
This paper reports our improvement over the previous benchmark of the task of answering poetic verses' thematic similarity multiple-choice questions (MCQs). In this experiment, we have trained a Doc2Vec model on a corpus of Persian poems and proceeded to use the trained model to get the vector representations of the poetic verses. Subsequently, the poetic verse among the options with the highest cosine similarity to the stem verse was selected as the correct answer by the model. This model managed to answer 38% of the questions correctly, which was an improvement of 6% over the previous benchmark. Provided that a large-scale thematic similarity MCQ dataset is developed, the performance of a language representation model on this task could be considered as a novel benchmark to measure the capacity of a model to understand metaphorical language.
Keywords :
digital humanities , figurative speech , poetry , computational linguistics , MCQ answering , Doc2Vec
Journal title :
International Journal of Information and Communication Technology Research
Serial Year :
2020
Record number :
2629213
Link To Document :
بازگشت