Reasoning - a theainerd Collection

theainerd 's Collections

Agents

Models

Reasoning

updated Jan 26

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 57
Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 100
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 338

Note Must Read
Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published Jan 20 • 32