Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
AdinaYΒ 
posted an update 4 days ago
Post
2701
BIG release by DeepSeek AIπŸ”₯πŸ”₯πŸ”₯

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co./deepseek-ai
deepseek-ai/DeepSeek-R1

✨ MIT License : enabling distillation for custom models
✨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
✨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'
In this post