Forward-Backward Reasoning in Large Language Models for Mathematical Verification Paper • 2308.07758 • Published Aug 15, 2023 • 4
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Paper • 2309.10814 • Published Sep 19, 2023 • 3
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Paper • 2310.03731 • Published Oct 5, 2023 • 25
SCREWS: A Modular Framework for Reasoning with Revisions Paper • 2309.13075 • Published Sep 20, 2023 • 15
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Paper • 2309.05653 • Published Sep 11, 2023 • 9
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Paper • 2308.09583 • Published Aug 18, 2023 • 7
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Paper • 2309.12284 • Published Sep 21, 2023 • 16
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Paper • 2309.17452 • Published Sep 29, 2023 • 1
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Paper • 2308.07921 • Published Aug 15, 2023 • 20
Improving Length-Generalization in Transformers via Task Hinting Paper • 2310.00726 • Published Oct 1, 2023 • 1
Improving Large Language Model Fine-tuning for Solving Math Problems Paper • 2310.10047 • Published Oct 16, 2023 • 5
Extracting Mathematical Concepts with Large Language Models Paper • 2309.00642 • Published Aug 29, 2023 • 1
ComputeGPT: A computational chat model for numerical problems Paper • 2305.06223 • Published May 8, 2023 • 1
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks Paper • 2211.12588 • Published Nov 22, 2022 • 3
Structured Chain-of-Thought Prompting for Code Generation Paper • 2305.06599 • Published May 11, 2023 • 1
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting Paper • 2305.07004 • Published May 11, 2023 • 1
SelfzCoT: a Self-Prompt Zero-shot CoT from Semantic-level to Code-level for a Better Utilization of LLMs Paper • 2305.11461 • Published May 19, 2023 • 1
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning Paper • 2305.18170 • Published May 29, 2023 • 2
Learning Multi-Step Reasoning by Solving Arithmetic Tasks Paper • 2306.01707 • Published Jun 2, 2023 • 1
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models Paper • 2308.01825 • Published Aug 3, 2023 • 19
FACT: Learning Governing Abstractions Behind Integer Sequences Paper • 2209.09543 • Published Sep 20, 2022 • 1
System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 38
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models Paper • 2312.06585 • Published Dec 11, 2023 • 26
Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math Paper • 2312.17120 • Published Dec 28, 2023 • 24
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent Paper • 2312.08926 • Published Dec 14, 2023 • 7
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? Paper • 2311.07587 • Published Nov 8, 2023 • 3
Leveraging Large Language Models for Automated Proof Synthesis in Rust Paper • 2311.03739 • Published Nov 7, 2023 • 5
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning Paper • 2308.00436 • Published Aug 1, 2023 • 20
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5 • 61
Augmenting Math Word Problems via Iterative Question Composing Paper • 2401.09003 • Published Jan 17 • 2
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning Paper • 2402.06332 • Published Feb 9 • 17
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline Paper • 2401.08190 • Published Jan 16
Common 7B Language Models Already Possess Strong Math Capabilities Paper • 2403.04706 • Published Mar 7 • 16
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs Paper • 2402.16352 • Published Feb 26 • 1
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Paper • 2402.10176 • Published Feb 15 • 33
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published about 1 month ago • 24
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning Paper • 2405.07551 • Published 19 days ago
Transformers Can Do Arithmetic with the Right Embeddings Paper • 2405.17399 • Published 4 days ago • 44
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data Paper • 2405.14333 • Published 9 days ago • 27