Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
368
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
GoodiesHere/Apollo-LMMs-Apollo-7B-t32
Video-Text-to-Text
•
Updated
7 days ago
•
213
•
29
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
2.46M
•
•
975
jinaai/jina-clip-v2
Feature Extraction
•
Updated
11 days ago
•
20.6k
•
142
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
•
Updated
7 days ago
•
115
•
14
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
769k
•
332
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
89.8k
•
229
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10
•
240k
•
470
prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
158
•
9
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
4 days ago
•
1.69k
•
53
NexaAIDev/OmniVLM-968M
Updated
8 days ago
•
4.52k
•
483
bartowski/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
5.35k
•
16
GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32
Video-Text-to-Text
•
Updated
7 days ago
•
174
•
6
robotics-diffusion-transformer/rdt-1b
Robotics
•
Updated
Oct 17
•
3.6k
•
55
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16
•
91.3k
•
87
bartowski/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
2.13k
•
9
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14
•
20.9k
•
597
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Oct 18
•
10k
•
758
qnguyen3/nanoLLaVA
Text Generation
•
Updated
Oct 27
•
23.4k
•
150
chenjoya/videollm-online-8b-v1plus
Video-Text-to-Text
•
Updated
Jul 13
•
5.55k
•
17
qnguyen3/nanoLLaVA-1.5
Image-Text-to-Text
•
Updated
Sep 21
•
482
•
104
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
Sep 21
•
22.2k
•
19
Qwen/Qwen2-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Sep 25
•
47.8k
•
40
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
Oct 10
•
3.41k
•
272
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101
Updated
Nov 1
•
37
•
2
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-SIDER-101
Updated
Nov 1
•
31
•
2
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
15 days ago
•
1.13k
•
20
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
4 days ago
•
2
imageomics/bioclip
Zero-Shot Image Classification
•
Updated
May 17
•
5.52k
•
42
HuggingFaceM4/idefics-80b
Text Generation
•
Updated
Oct 12, 2023
•
67
•
67
HuggingFaceM4/idefics-9b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
3.65k
•
104
Previous
1
2
3
...
13
Next