Llama_3.1_8B_GRPO / pytorch_model-00002-of-00004.bin

Commit History

Trained with Unsloth
2774f3f
verified

colesmcintosh commited on