File size: 309 Bytes
97643bd c4d538e 97643bd c4d538e |
1 2 3 4 5 6 7 |
---
license: llama2
pipeline_tag: image-text-to-text
---
This repository contains the Elva-Vicuna-13B model presented in [On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning](https://huggingface.co./papers/2406.11823).
|