Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Paper • 2501.08331 • Published 9 days ago • 16
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 2 days ago • 16
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 3 days ago • 43
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 5 days ago • 20
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 9 days ago • 57
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Paper • 2501.09433 • Published 8 days ago • 17
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Paper • 2501.09756 • Published 7 days ago • 18
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper • 2501.08225 • Published 10 days ago • 18
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 16 days ago • 41
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 14 days ago • 82
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 16 days ago • 245
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Paper • 2204.12484 • Published Apr 26, 2022 • 2
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 18 days ago • 22
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 17 days ago • 23
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published 24 days ago • 16
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper • 2412.21037 • Published 25 days ago • 23
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published Dec 19, 2024 • 18