This repository contains the trained PersonalizedRouter model weights saved as a .pth file.
In the project files, the suffix v1 refers to the Multi-cost-efficiency Simulation Strategy described in the paper, while v2 refers to the LLM-as-a-Judge Simulation Strategy.
For best_model_v1.pth, the model was trained on an interaction dataset generated by 10 LLMs, 240 queries, and 9 different performance and cost settings.
For best_model_v2.pth, the model was trained on an interaction dataset generated by 10 LLMs, 240 queries, and preferences from 9 different user groups.
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	๐
			
		Ask for provider support