SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated about 17 hours ago • 33
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 6 days ago • 6
SwiftKV Models Collection SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 4 items • Updated about 8 hours ago • 5
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 16 days ago • 87
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated 7 days ago • 30
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 16 days ago • 245
Riva Collection A family of Riva production (NVAIE) speech models that achieve state-of-the-art results on speech transcription, translation, and synthesis tasks. • 1 item • Updated 7 days ago • 3
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 26 days ago • 11