LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 114
apple/aimv2-large-patch14-336-distilled Image Feature Extraction • Updated about 2 hours ago • 96 • 3
apple/aimv2-large-patch14-224-lit Zero-Shot Image Classification • Updated about 2 hours ago • 654 • 5