Insertion-based Decoding With Automatically Inferred Generation Order · The Large Language Model Bible Contribute to LLM-Bible

Insertion-based Decoding With Automatically Inferred Generation Order

Gu Jiatao, Liu Qi, Cho Kyunghyun. Arxiv 2019

[Paper]    
Applications GPT Model Architecture Pretraining Methods Reinforcement Learning Transformer

Conventional neural autoregressive decoding commonly assumes a fixed left-to-right generation order, which may be sub-optimal. In this work, we propose a novel decoding algorithm – InDIGO – which supports flexible sequence generation in arbitrary orders through insertion operations. We extend Transformer, a state-of-the-art sequence generation model, to efficiently implement the proposed approach, enabling it to be trained with either a pre-defined generation order or adaptive orders obtained from beam-search. Experiments on four real-world tasks, including word order recovery, machine translation, image caption and code generation, demonstrate that our algorithm can generate sequences following arbitrary orders, while achieving competitive or even better performance compared to the conventional left-to-right generation. The generated sequences show that InDIGO adopts adaptive generation orders based on input information.

Similar Work