leafspark commited on
Commit
9918d3f
·
verified ·
1 Parent(s): e9f31f9

docs: add model card

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -2,4 +2,10 @@
2
  license: llama3.2
3
  base_model:
4
  - meta-llama/Llama-3.2-11B-Vision-Instruct
5
- ---
 
 
 
 
 
 
 
2
  license: llama3.2
3
  base_model:
4
  - meta-llama/Llama-3.2-11B-Vision-Instruct
5
+ ---
6
+
7
+ # Llama-3.2-11B-Vision-Instruct-GGUF
8
+
9
+ Sourced from [Ollama](https://ollama.com/library/llama3.2-vision).
10
+
11
+ The Llama 3.2-Vision collection of multimodal large language models (LLMs) is a collection of pretrained and instruction-tuned image reasoning generative models in 11B and 90B sizes (text + images in / text out). The Llama 3.2-Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. The models outperform many of the available open source and closed multimodal models on common industry benchmarks.