view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 790
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 3 days ago • 102
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated 20 days ago • 30
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • Jan 3 • 35
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated Dec 9, 2024 • 9
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 139
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 8 days ago • 34
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 16 days ago • 91
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 143