Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 9 days ago • 317
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 26 days ago • 43
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2