leafspark
/

Llama-3.2-11B-Vision-Instruct-GGUF

Model card Files Files and versions Community

leafspark commited on 18 days ago

Commit

9918d3f

·

verified ·

1 Parent(s): e9f31f9

docs: add model card

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -2,4 +2,10 @@
 license: llama3.2
 base_model:
 - meta-llama/Llama-3.2-11B-Vision-Instruct
----

 license: llama3.2
 base_model:
 - meta-llama/Llama-3.2-11B-Vision-Instruct
+---
+# Llama-3.2-11B-Vision-Instruct-GGUF
+Sourced from [Ollama](https://ollama.com/library/llama3.2-vision).
+The Llama 3.2-Vision collection of multimodal large language models (LLMs) is a collection of pretrained and instruction-tuned image reasoning generative models in 11B and 90B sizes (text + images in / text out). The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. The models outperform many of the available open source and closed multimodal models on common industry benchmarks.