Running 1.79k 1.79k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 790
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 28 days ago • 40
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated 3 days ago • 14
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 151