niklasm222/Qwen2.5-3B-Instruct-1K_subset-GRPO-gsm8k-prolog-prover-v1 Text Generation • Updated 24 days ago • 21