Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 5 days ago • 1.26M • • 954
intfloat/multilingual-e5-large-instruct Feature Extraction • Updated 12 days ago • 643k • • 350
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8 Text Generation • Updated Nov 18, 2024 • 1.87k • 64
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR By abhinand • Oct 20, 2024 • 35