LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper • 2311.05437 • Published Nov 9, 2023 • 45
Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Paper • 2310.08166 • Published Oct 12, 2023 • 1