Submitted by ksshumab 51 Predictive Data Selection: The Data That Predicts Is the Data That Teaches · 8 authors 2
Submitted by lzq2021 32 DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking · 9 authors 4
Submitted by nicolas-dufour 25 How far can we go with ImageNet for Text-to-Image generation? · 5 authors 2
Submitted by autumncc 18 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents · 7 authors 2
Submitted by akhaliq 12 Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids · 5 authors 2
Submitted by kamahori 11 LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation · 4 authors 2
Submitted by hturbe 10 Tell me why: Visual foundation models as self-explainable classifiers · 4 authors 2
Submitted by kamahori 9 TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval · 14 authors 2
Submitted by Yifan-Zhong 7 DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping · 7 authors 2
Submitted by adaamko 7 LettuceDetect: A Hallucination Detection Framework for RAG Applications · 2 authors 2
Submitted by BestWishYsh 4 MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing · 6 authors 2
Submitted by akhaliq 2 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models · 8 authors 2