Do Gpts Produce Less Literal Translations?

Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan. Arxiv 2023

[Paper]
Applications Few Shot GPT In Context Learning Model Architecture Prompting

Large Language Models (LLMs) such as GPT-3 have emerged as general-purpose language models capable of addressing many natural language generation or understanding tasks. On the task of Machine Translation (MT), multiple works have investigated few-shot prompting mechanisms to elicit better translations from LLMs. However, there has been relatively little investigation on how such translations differ qualitatively from the translations generated by standard Neural Machine Translation (NMT) models. In this work, we investigate these differences in terms of the literalness of translations produced by the two systems. Using literalness measures involving word alignment and monotonicity, we find that translations out of English (E-X) from GPTs tend to be less literal, while exhibiting similar or better scores on MT quality metrics. We demonstrate that this finding is borne out in human evaluations as well. We then show that these differences are especially pronounced when translating sentences that contain idiomatic expressions.

The Large Language Model Bible

Do Gpts Produce Less Literal Translations?

Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan. Arxiv 2023

Similar Work