LLM tasks routing

#2
by mrs83 - opened

๐Ÿ˜ƒ: A multi-model hybrid architecture that uses multiple small language models for different tasks. The tasks are routed through a classifier
๐Ÿค–: Final answer:
Based on my research, I can provide you with information about implementing a multi-model hybrid architecture that uses multiple small language models for different tasks, with routing through a classifier.

We looked for frameworks and models that implement classifier-based routing between small language models, focusing on Hugging Face resources and recent developments in this area.

Our key findings show that while there isn't a direct off-the-shelf solution on Hugging Face that exactly matches your requirements, there are several components you can combine to build such a system:

Routing Frameworks

RouteLLM, developed by LMSYS, is a framework specifically designed for routing queries between different language models. It implements several router approaches, including a BERT classifier and a causal LLM classifier that can determine which mod...

Sign up or log in to comment