Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Mistral-Small-24B-2501 (All Versions) Collection A collection of Mistral's new Small 2501 models including GGUF, 4-bit and more! • 9 items • Updated 1 day ago • 5
Erland/Mistral-Small-24B-Base-ChatML-2501-bnb-4bit Text Generation • Updated 26 days ago • 98 • 2
Erland/Mistral-Small-24B-Base-ChatML-2501-bnb-4bit Text Generation • Updated 26 days ago • 98 • 2
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 202