Joint Copying And Restricted Generation For Paraphrase · The Large Language Model Bible Contribute to LLM-Bible

Joint Copying And Restricted Generation For Paraphrase

Cao Ziqiang, Luo Chuwei, Li Wenjie, Li Sujian. Arxiv 2016

[Paper]    
Applications Attention Mechanism Model Architecture Training Techniques

Many natural language generation tasks, such as abstractive summarization and text simplification, are paraphrase-orientated. In these tasks, copying and rewriting are two main writing modes. Most previous sequence-to-sequence (Seq2Seq) models use a single decoder and neglect this fact. In this paper, we develop a novel Seq2Seq model to fuse a copying decoder and a restricted generative decoder. The copying decoder finds the position to be copied based on a typical attention model. The generative decoder produces words limited in the source-specific vocabulary. To combine the two decoders and determine the final output, we develop a predictor to predict the mode of copying or rewriting. This predictor can be guided by the actual writing mode in the training data. We conduct extensive experiments on two different paraphrase datasets. The result shows that our model outperforms the state-of-the-art approaches in terms of both informativeness and language quality.

Similar Work