Trained for cross-domain generalisation experiments for the Reasoning Gym paper.

Downloads last month
4
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for OllieStanley/Qwen2.5-3B-Instruct-RG-Logic

Base model

Qwen/Qwen2.5-3B
Finetuned
(623)
this model