Post
2701
BIG release by DeepSeek AIπ₯π₯π₯
DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co./deepseek-ai
deepseek-ai/DeepSeek-R1
β¨ MIT License : enabling distillation for custom models
β¨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
β¨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'
DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co./deepseek-ai
deepseek-ai/DeepSeek-R1
β¨ MIT License : enabling distillation for custom models
β¨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
β¨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'