rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 6 days ago • 209
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 25 days ago • 38
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 54
GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models Paper • 2411.05830 • Published Nov 5, 2024 • 20
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 Text Generation • Updated Oct 10, 2024 • 16.6k • 28