SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 5 days ago • 140
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 3 days ago • 222
ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text • Updated about 20 hours ago • 1.74k • 31
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 9 days ago • 31
alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta Image-to-Image • Updated Oct 12, 2024 • 13.1k • 269