deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated 5 days ago • 1.26M • • 1.2k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated 5 days ago • 463k • • 604
deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • Updated 5 days ago • 1.32M • • 614
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 5 days ago • 1.26M • • 955
Running 526 526 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute