DeepSeek-R1 Collection by deepseek-ai 3 days ago 104 deepseek-ai/DeepSeek-R1 Text Generation • Updated about 10 hours ago • 44.6k • 2k deepseek-ai/DeepSeek-R1-Zero Text Generation • Updated about 10 hours ago • 3.04k • 371 deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated about 10 hours ago • 10.5k • 195 deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 10 hours ago • 63.7k • • 392
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 10 hours ago • 63.7k • • 392
DeepSeek R1 (All Versions) DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. Collection by unsloth 2 days ago 53 unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Updated 2 days ago • 37.4k • 90 unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF Updated 3 days ago • 21.8k • 49 unsloth/DeepSeek-R1-Distill-Qwen-14B-GGUF Updated 3 days ago • 14.7k • 32 unsloth/DeepSeek-R1-Distill-Qwen-7B-GGUF Updated 3 days ago • 14.2k • 40
SmolVLM 256M & 500M Collection for models & demos for even smoller SmolVLM release Collection by HuggingFaceTB about 6 hours ago 24 HuggingFaceTB/SmolVLM-256M-Instruct Image-Text-to-Text • Updated about 6 hours ago • 244 • 23 HuggingFaceTB/SmolVLM-500M-Instruct Image-Text-to-Text • Updated about 6 hours ago • 88 • 19 Running on Zero 13 📊 SmolVLM HuggingFaceTB/SmolVLM-256M-Base Image-Text-to-Text • Updated 3 days ago • 56
Cosmos The collection of Cosmos models Collection by nvidia 7 days ago 246 nvidia/Cosmos-1.0-Guardrail Updated 14 days ago • 5.44k • 41 nvidia/Cosmos-1.0-Autoregressive-4B Updated 14 days ago • 2.11k • 46
DeepSeek-V3 Collection by deepseek-ai 18 days ago 130 deepseek-ai/DeepSeek-V3-Base Updated 25 days ago • 19.8k • 1.3k deepseek-ai/DeepSeek-V3 Updated 25 days ago • 200k • 2.2k DeepSeek-V3 Technical Report Paper • 2412.19437 • Published 28 days ago • 27
Meta's Llama 3.2 language models & evals Collection by meta-llama Dec 13, 2024 50 meta-llama/Llama-3.2-1B Text Generation • Updated Oct 24, 2024 • 1.26M • • 1.48k meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 1.15M • • 715 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 1.51M • • 919 meta-llama/Llama-3.2-3B Text Generation • Updated Oct 24, 2024 • 427k • • 466
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen Nov 28, 2024 473 Running 613 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated Sep 25, 2024 • 337k • 162 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated Sep 25, 2024 • 581k • 190 Qwen/Qwen2.5-1.5B Text Generation • Updated Oct 8, 2024 • 91.2k • 56
GTE ModernBERT GTE Models Based on ModernBERT Collection by Alibaba-NLP 2 days ago 10 Alibaba-NLP/gte-modernbert-base Sentence Similarity • Updated 1 day ago • 519 • 54 Alibaba-NLP/gte-reranker-modernbert-base Sentence Similarity • Updated about 12 hours ago • 219 • 33
Jan 17 Releases ❄️ Models and datasets of the second week of Jan 2025. Collection by merve 6 days ago 10 openbmb/MiniCPM-o-2_6 Any-to-Any • Updated about 11 hours ago • 50.8k • 788 MiniMaxAI/MiniMax-Text-01 Text Generation • Updated 7 days ago • 3.94k • 469 OuteAI/OuteTTS-0.3-1B Text-to-Speech • Updated 6 days ago • 8.11k • 80 NovaSky-AI/Sky-T1_data_17k Viewer • Updated 9 days ago • 16.4k • 2.86k • 143
Phi-4 (All Versions) Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. Collection by unsloth 3 days ago 35 unsloth/phi-4-GGUF Text Generation • Updated 10 days ago • 61.2k • 121 unsloth/phi-4-unsloth-bnb-4bit Text Generation • Updated 10 days ago • 41.5k • 33 unsloth/phi-4 Text Generation • Updated 10 days ago • 14.7k • 65 unsloth/phi-4-bnb-4bit Text Generation • Updated 10 days ago • 3.02k • 10