view post Post 2483 Common formula to DIY a LLM:Post train a Qwen model with a dataset distilled from DeepSeek ๐ See translation 2 replies ยท ๐ค 4 4 ๐ 2 2 + Reply
Running 550 550 Scaling test-time compute ๐ Enhance math problem solving by scaling test-time compute