X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering · The Large Language Model Bible Contribute to LLM-Bible

X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering

M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan. Arxiv 2021

[Paper]    
Applications Attention Mechanism BERT Efficiency And Optimization Fine Tuning Model Architecture Pretraining Methods RAG Reinforcement Learning Tools Training Techniques

Multilingual models, such as M-BERT and XLM-R, have gained increasing popularity, due to their zero-shot cross-lingual transfer learning capabilities. However, their generalization ability is still inconsistent for typologically diverse languages and across different benchmarks. Recently, meta-learning has garnered attention as a promising technique for enhancing transfer learning under low-resource scenarios: particularly for cross-lingual transfer in Natural Language Understanding (NLU). In this work, we propose X-METRA-ADA, a cross-lingual MEta-TRAnsfer learning ADAptation approach for NLU. Our approach adapts MAML, an optimization-based meta-learning approach, to learn to adapt to new languages. We extensively evaluate our framework on two challenging cross-lingual NLU tasks: multilingual task-oriented dialog and typologically diverse question answering. We show that our approach outperforms naive fine-tuning, reaching competitive performance on both tasks for most languages. Our analysis reveals that X-METRA-ADA can leverage limited data for faster adaptation.

Similar Work