Memory And Knowledge Augmented Language Models For Inferring Salience In Long-form Stories · The Large Language Model Bible Contribute to LLM-Bible

Memory And Knowledge Augmented Language Models For Inferring Salience In Long-form Stories

Wilmot David, Keller Frank. Arxiv 2021

[Paper]    
Model Architecture Pretraining Methods Transformer

Measuring event salience is essential in the understanding of stories. This paper takes a recent unsupervised method for salience detection derived from Barthes Cardinal Functions and theories of surprise and applies it to longer narrative forms. We improve the standard transformer language model by incorporating an external knowledgebase (derived from Retrieval Augmented Generation) and adding a memory mechanism to enhance performance on longer works. We use a novel approach to derive salience annotation using chapter-aligned summaries from the Shmoop corpus for classic literary works. Our evaluation against this data demonstrates that our salience detection model improves performance over and above a non-knowledgebase and memory augmented language model, both of which are crucial to this improvement.

Similar Work