HuggingFaceTB/SmolVLM2-500M-Video-Instruct Video-Text-to-Text • Updated 2 days ago • 2.99k • 34
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 9 days ago • 34
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published Jan 13 • 50