AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions Paper • 2312.08472 • Published Dec 13, 2023 • 2
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper • 2403.14624 • Published Mar 21 • 49
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Paper • 2404.02893 • Published Apr 3 • 19
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published 8 days ago • 21