Trained for cross-domain generalisation experiments for the Reasoning Gym paper.

Downloads last month
9
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for OllieStanley/Qwen2.5-3B-Instruct-RG-Games

Base model

Qwen/Qwen2.5-3B
Finetuned
(631)
this model