An Expert Is Worth One Token: Synergizing Multiple Expert Llms As Generalist Via Expert Token Routing · The Large Language Model Bible Contribute to LLM-Bible

An Expert Is Worth One Token: Synergizing Multiple Expert Llms As Generalist Via Expert Token Routing

Chai Ziwei, Wang Guoyin, Su Jing, Zhang Tianjie, Huang Xuanwen, Wang Xuwu, Xu Jingjing, Yuan Jianbo, Yang Hongxia, Wu Fei, Yang Yang. Arxiv 2024

[Paper]    
Reinforcement Learning Security Tools Uncategorized

We present Expert-Token-Routing, a unified generalist framework that facilitates seamless integration of multiple expert LLMs. Our framework represents expert LLMs as special expert tokens within the vocabulary of a meta LLM. The meta LLM can route to an expert LLM like generating new tokens. Expert-Token-Routing not only supports learning the implicit expertise of expert LLMs from existing instruction dataset but also allows for dynamic extension of new expert LLMs in a plug-and-play manner. It also conceals the detailed collaboration process from the user’s perspective, facilitating interaction as though it were a singular LLM. Our framework outperforms various existing multi-LLM collaboration paradigms across benchmarks that incorporate six diverse expert domains, demonstrating effectiveness and robustness in building generalist LLM system via synergizing multiple expert LLMs.

Similar Work