deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • Updated 24 days ago • 1.55M • • 655
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 352