Thematic Similarity Multiple-Choice Question Answering with Doc2Vec: A Step Toward Metaphorical Language Processing

Soroosh Akef, Soroosh; Hadi Bokaei, Mohammad Hadi; Sameti, Hossein

Volume 12, Issue 2 (6-2020) itrc 2020, 12(2): 46-53 | Back to browse issues page

Mendeley

Zotero

RefWorks

Soroosh Akef S, Hadi Bokaei M H, Sameti H. Thematic Similarity Multiple-Choice Question Answering with Doc2Vec: A Step Toward Metaphorical Language Processing. itrc 2020; 12 (2) :46-53
URL: http://journal.itrc.ac.ir/article-1-459-en.html

Thematic Similarity Multiple-Choice Question Answering with Doc2Vec: A Step Toward Metaphorical Language Processing

Soroosh Soroosh Akef¹

, Mohammad Hadi Hadi Bokaei

², Hossein Sameti³

1- Languages and Linguistics Center Sharif University of Technology Tehran, Iran
2- Department of Information Technology Iran Telecommunication Research Center Tehran, Iran , mh.bokaei@itrc.ac.ir
3- Department of Computer Engineering Sharif University of Technology Tehran, Iran

Abstract: (1926 Views)

This paper reports our improvement over the previous benchmark of the task of answering poetic verses' thematic similarity multiple-choice questions (MCQs). In this experiment, we have trained a Doc2Vec model on a corpus of Persian poems and proceeded to use the trained model to get the vector representations of the poetic verses. Subsequently, the poetic verse among the options with the highest cosine similarity to the stem verse was selected as the correct answer by the model. This model managed to answer 38% of the questions correctly, which was an improvement of 6% over the previous benchmark. Provided that a large-scale thematic similarity MCQ dataset is developed, the performance of a language representation model on this task could be considered as a novel benchmark to measure the capacity of a model to understand metaphorical language.

Keywords: Doc2Vec, MCQ answering, computational linguistics, poetry, figurative speech, digital humanities.

Full-Text [PDF 1056 kb] (721 Downloads)

Type of Study: Research | Subject: Information Technology

Send email to the article author

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Principal Contact