Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ashioyajotham 's Collections
Fav papers
LLM Reasoning
safety
finetuning
Scale
VLMs

LLM Reasoning

updated 12 days ago
Upvote
-

  • Teaching Large Language Models to Reason with Reinforcement Learning

    Paper • 2403.04642 • Published Mar 7, 2024 • 51

  • How Far Are We from Intelligent Visual Deductive Reasoning?

    Paper • 2403.04732 • Published Mar 7, 2024 • 24

  • Common 7B Language Models Already Possess Strong Math Capabilities

    Paper • 2403.04706 • Published Mar 7, 2024 • 21

  • DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

    Paper • 2405.14333 • Published May 23, 2024 • 41

  • Towards General-Purpose Model-Free Reinforcement Learning

    Paper • 2501.16142 • Published Jan 27 • 30

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28 • 121

  • FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

    Paper • 2505.02735 • Published 13 days ago • 27
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs