18 3

AI Papers Academy

aipapersacademy

AI & ML interests

None yet

Recent Activity

commented on a paper 16 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

commented on a paper about 1 month ago

START: Self-taught Reasoner with Tools

commented on a paper about 1 month ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

View all activity

Organizations

None yet

aipapersacademy's activity

commented a paper 16 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 23 days ago • 116 •

commented 2 papers about 1 month ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 105 •

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 73 •

commented 2 papers about 2 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 111 •

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12 • 28 •

commented 2 papers 2 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 116 •

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 379 •

commented a paper 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275 •

commented 2 papers 4 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 97 •

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 82 •

commented 3 papers 5 months ago

upvoted a paper 7 months ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 143

commented a paper 7 months ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 143 •

upvoted a paper 8 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92

commented 2 papers 8 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92 •

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Paper • 2407.19985 • Published Jul 29, 2024 • 37 •

upvoted a paper 10 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60

commented a paper 10 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60 •