File size: 622 Bytes
c2d6abe 01875b9 c2d6abe db6b574 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
license: gpl-3.0
datasets:
- JosephusCheung/GuanacoVQADataset
language:
- en
- zh
- ja
- de
pipeline_tag: visual-question-answering
---
The following content is currently a work in progress and does not represent the final quality.
Alignment for the multilingual VQA tasks is being conducted on blip2-flan-t5-xxl and Guanaco using only Linear Layers.
The latest weight file is provided here, based on the implementation of MiniGPT-4.
This model supports English, Chinese, Japanese, and German languages and requires the combined use of the Guanaco 7B LLM model.
A portion of the dataset has already been released. |