Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching

Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao. Arxiv 2024

[Paper]
Applications Attention Mechanism Few Shot Fine Tuning Model Architecture Pretraining Methods Prompting RAG Training Techniques

The recent success of Large Language Models (LLMs) has garnered significant attention in both academia and industry. Prior research on LLMs has primarily focused on enhancing or leveraging their generalization capabilities in zero- and few-shot settings. However, there has been limited investigation into effectively fine-tuning LLMs for a specific natural language understanding task in supervised settings. In this study, we conduct an experimental analysis by fine-tuning LLMs for the task of Chinese short text matching. We explore various factors that influence performance when fine-tuning LLMs, including task modeling methods, prompt formats, and output formats.

The Large Language Model Bible

Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching

Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao. Arxiv 2024

Similar Work