Spaces:
Running
LLM tasks routing
๐: A multi-model hybrid architecture that uses multiple small language models for different tasks. The tasks are routed through a classifier
๐ค: Final answer:
Based on my research, I can provide you with information about implementing a multi-model hybrid architecture that uses multiple small language models for different tasks, with routing through a classifier.
We looked for frameworks and models that implement classifier-based routing between small language models, focusing on Hugging Face resources and recent developments in this area.
Our key findings show that while there isn't a direct off-the-shelf solution on Hugging Face that exactly matches your requirements, there are several components you can combine to build such a system:
Routing Frameworks
RouteLLM, developed by LMSYS, is a framework specifically designed for routing queries between different language models. It implements several router approaches, including a BERT classifier and a causal LLM classifier that can determine which mod...