DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community! deepseek-ai deepseek-ai/DeepSeek-R1
β¨ MIT License : enabling distillation for custom models β¨ 32B & 70B models match OpenAI o1-mini in multiple capabilities β¨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'