LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 20 days ago • 48
laion/CLIP-ViT-H-14-laion2B-s32B-b79K Zero-Shot Image Classification • Updated 5 days ago • 1.49M • 350
alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta Image-to-Image • Updated Oct 12, 2024 • 12k • 259