Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning Paper • 2410.06373 • Published 9 days ago • 33
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published 9 days ago • 101
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification Paper • 2410.05057 • Published 10 days ago • 7
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 28 days ago • 131
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines Paper • 2409.12959 • Published 28 days ago • 35
Rejuvenating image-GPT as Strong Visual Representation Learners Paper • 2312.02147 • Published Dec 4, 2023 • 4