EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published 2 days ago • 24
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer Paper • 2502.05979 • Published 6 days ago • 4
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 3 days ago • 99
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Paper • 2502.08690 • Published 3 days ago • 31
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published 2 days ago • 23
Exploring the Potential of Encoder-free Architectures in 3D LMMs Paper • 2502.09620 • Published 2 days ago • 21
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published 2 days ago • 24
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper • 2502.09082 • Published 2 days ago • 19
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published 2 days ago • 18
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 5 days ago • 22
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published 2 days ago • 25
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper • 2502.09390 • Published 2 days ago • 8
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 3 days ago • 103
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Paper • 2502.05761 • Published 7 days ago • 4
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data Paper • 2502.08468 • Published 3 days ago • 10
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published 4 days ago • 43
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published 4 days ago • 41