Running 1.78k 1.78k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 14 days ago • 50
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated 8 days ago • 110k • 4.77k • 404
CodeI/O Collection Collection for CodeI/O @ https://codei-o.github.io/ • 15 items • Updated 16 days ago • 6
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published 17 days ago • 45
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published 21 days ago • 41