rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 4 days ago • 185
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 8 days ago • 72
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 77
cognitivecomputations/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled Viewer • Updated 5 days ago • 3.01M • 34 • 6
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 13 days ago • 22
MahmoudAshraf/mms-300m-1130-forced-aligner Automatic Speech Recognition • Updated Sep 28, 2024 • 1.9M • 36
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 19 days ago • 35