XRJL-HKUST At Semeval-2021 Task 4: Wordnet-enhanced Dual Multi-head Co-attention For Reading Comprehension Of Abstract Meaning

Jiang Yuxin, Shou Ziyi, Wang Qijun, Wu Hao, Lin Fangzhen. Arxiv 2021

[Paper]
Attention Mechanism Model Architecture

This paper presents our submitted system to SemEval 2021 Task 4: Reading Comprehension of Abstract Meaning. Our system uses a large pre-trained language model as the encoder and an additional dual multi-head co-attention layer to strengthen the relationship between passages and question-answer pairs, following the current state-of-the-art model DUMA. The main difference is that we stack the passage-question and question-passage attention modules instead of calculating parallelly to simulate re-considering process. We also add a layer normalization module to improve the performance of our model. Furthermore, to incorporate our known knowledge about abstract concepts, we retrieve the definitions of candidate answers from WordNet and feed them to the model as extra inputs. Our system, called WordNet-enhanced DUal Multi-head Co-Attention (WN-DUMA), achieves 86.67% and 89.99% accuracy on the official blind test set of subtask 1 and subtask 2 respectively.

The Large Language Model Bible

XRJL-HKUST At Semeval-2021 Task 4: Wordnet-enhanced Dual Multi-head Co-attention For Reading Comprehension Of Abstract Meaning

Jiang Yuxin, Shou Ziyi, Wang Qijun, Wu Hao, Lin Fangzhen. Arxiv 2021

Similar Work