MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors Paper • 2502.18940 • Published Feb 26 • 1
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning Paper • 2411.04983 • Published Nov 7, 2024 • 12
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 272
Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published Oct 22, 2024 • 23
Qwen2-Math Collection Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 51
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 7 days ago • 162