somaniarushi
commited on
Commit
•
04ceb13
1
Parent(s):
b5c3f72
Update README.md
Browse files
README.md
CHANGED
@@ -3,7 +3,7 @@ license: cc
|
|
3 |
---
|
4 |
# Fuyu-8B Model Card
|
5 |
|
6 |
-
We’re releasing Fuyu-8B, a small version of the
|
7 |
1. It has a much simpler architecture and training procedure than other multi-modal models, which makes it easier to understand, scale, and deploy.
|
8 |
2. It’s designed from the ground up for digital agents, so it can support arbitrary image resolutions, answer questions about graphs and diagrams, answer UI-based questions, and do fine-grained localization on screen images.
|
9 |
3. It’s fast - we can get responses for large images in less than 100 milliseconds.
|
|
|
3 |
---
|
4 |
# Fuyu-8B Model Card
|
5 |
|
6 |
+
We’re releasing Fuyu-8B, a small version of the multimodal model that powers our product. The model is available on HuggingFace. We think Fuyu-8B is exciting because:
|
7 |
1. It has a much simpler architecture and training procedure than other multi-modal models, which makes it easier to understand, scale, and deploy.
|
8 |
2. It’s designed from the ground up for digital agents, so it can support arbitrary image resolutions, answer questions about graphs and diagrams, answer UI-based questions, and do fine-grained localization on screen images.
|
9 |
3. It’s fast - we can get responses for large images in less than 100 milliseconds.
|