Temporal Preference Optimization for Long-Form Video Understanding Paper • 2501.13919 • Published 4 days ago • 18
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Paper • 2501.13926 • Published 4 days ago • 25
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 213
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper • 2501.12224 • Published 6 days ago • 44
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Paper • 2501.12202 • Published 6 days ago • 26
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published 7 days ago • 76
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 13 days ago • 58
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 7 days ago • 45
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 10 days ago • 37
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published 13 days ago • 30
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published 11 days ago • 22
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Paper • 2501.09433 • Published 11 days ago • 17
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 11 days ago • 65
PokerBench: Training Large Language Models to become Professional Poker Players Paper • 2501.08328 • Published 13 days ago • 13
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 13 days ago • 32
Multi-subject Open-set Personalization in Video Generation Paper • 2501.06187 • Published 17 days ago • 13
OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints Paper • 2501.03841 • Published 20 days ago • 52