Kaizen GRPO Model

Fine-tuned Qwen2.5-3B-Instruct with GRPO for OS management tasks.

Downloads last month
-
Safetensors
Model size
3B params
Tensor type
F16
·
Inference Providers NEW
Input a message to start chatting with NehaChikle/kaizen-grpo.

Space using NehaChikle/kaizen-grpo 1