The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio Paper • 2410.12787 • Published 28 days ago • 30
MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published 27 days ago • 31
MULTI: Multimodal Understanding Leaderboard with Text and Images Paper • 2402.03173 • Published Feb 5 • 3