llama3.1-8B-gsm8k-grpo / pytorch_model-00002-of-00004.bin

Commit History

Trained with Unsloth
b42217f
verified

ubermenchh commited on