SWERL Qwen3 8B Termigen GRPO

Final checkpoint from the hamishivi/agent-task-termigen GRPO run.

  • Base model: hamishivi/sft_qwen3_8b_our_sft
  • Run ID: swerl_qwen3_8b_our_sft_agent_task_termigen_grpo__seed42
  • Run name: swerl_qwen3_8b_our_sft_agent_task_termigen_grpo__42__1779039229
  • Training completed: 2026-05-18

This checkpoint is intended for internal evaluation and continuation experiments.

Downloads last month
320
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with wAI-org/swerl-qwen3-8b-termigen-grpo.

Model tree for wAI-org/swerl-qwen3-8b-termigen-grpo

Finetuned
(4)
this model