Collections
Discover the best community collections!
Collections trending this week
-
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 67 -
OpenGVLab/InternVL2-8B-MPO
Image-Text-to-Text • Updated • 3.21k • 31 -
OpenGVLab/MMPR
Preview • Updated • 230 • 40 -
OpenGVLab/InternVL2_5-1B-MPO
Image-Text-to-Text • Updated • 136 • 14
-
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 118 -
google/paligemma2-3b-pt-224
Image-Text-to-Text • Updated • 25.7k • 112 -
google/paligemma2-3b-pt-448
Image-Text-to-Text • Updated • 19.9k • 35 -
google/paligemma2-3b-pt-896
Image-Text-to-Text • Updated • 2.94k • 21