Leveraging Pre-trained Language Models For Conversational Information Seeking From Text · The Large Language Model Bible Contribute to LLM-Bible

Leveraging Pre-trained Language Models For Conversational Information Seeking From Text

Bellan Patrizio, Dragoni Mauro, Ghidini Chiara. Arxiv 2022

[Paper]    
GPT In Context Learning Model Architecture Pretraining Methods Prompting RAG Training Techniques Transformer

Recent advances in Natural Language Processing, and in particular on the construction of very large pre-trained language representation models, is opening up new perspectives on the construction of conversational information seeking (CIS) systems. In this paper we investigate the usage of in-context learning and pre-trained language representation models to address the problem of information extraction from process description documents, in an incremental question and answering oriented fashion. In particular we investigate the usage of the native GPT-3 (Generative Pre-trained Transformer 3) model, together with two in-context learning customizations that inject conceptual definitions and a limited number of samples in a few shot-learning fashion. The results highlight the potential of the approach and the usefulness of the in-context learning customizations, which can substantially contribute to address the “training data challenge” of deep learning based NLP techniques the BPM field. It also highlight the challenge posed by control flow relations for which further training needs to be devised.

Similar Work