EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints · The Large Language Model Bible Contribute to LLM-Bible

EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints

Xu Weijia, Carpuat Marine. Arxiv 2020

[Paper]    
Applications GPT Model Architecture Pretraining Methods Transformer

We introduce an Edit-Based Transformer with Repositioning (EDITOR), which makes sequence generation flexible by seamlessly allowing users to specify preferences in output lexical choice. Building on recent models for non-autoregressive sequence generation (Gu et al., 2019), EDITOR generates new sequences by iteratively editing hypotheses. It relies on a novel reposition operation designed to disentangle lexical choice from word positioning decisions, while enabling efficient oracles for imitation learning and parallel edits at decoding time. Empirically, EDITOR uses soft lexical constraints more effectively than the Levenshtein Transformer (Gu et al., 2019) while speeding up decoding dramatically compared to constrained beam search (Post and Vilar, 2018). EDITOR also achieves comparable or better translation quality with faster decoding speed than the Levenshtein Transformer on standard Romanian-English, English-German, and English-Japanese machine translation tasks.

Similar Work