LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Paper • 2412.09856 • Published 12 days ago • 9
Motion Control for Enhanced Complex Action Video Generation Paper • 2411.08328 • Published Nov 13 • 5
SketchAgent: Language-Driven Sequential Sketch Generation Paper • 2411.17673 • Published 28 days ago • 18
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Paper • 2411.18197 • Published 28 days ago • 14
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22 • 17
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published 27 days ago • 50
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Paper • 2411.18673 • Published 27 days ago • 8
Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published 21 days ago • 12
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published 19 days ago • 9
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published 23 days ago • 45
MV-Adapter: Multi-view Consistent Image Generation Made Easy Paper • 2412.03632 • Published 20 days ago • 21
PanoDreamer: 3D Panorama Synthesis from a Single Image Paper • 2412.04827 • Published 19 days ago • 10
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion Paper • 2412.04301 • Published 20 days ago • 32
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion Paper • 2409.10141 • Published Sep 16 • 1
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Paper • 2412.03558 • Published 20 days ago • 14
Video-Guided Foley Sound Generation with Multimodal Controls Paper • 2411.17698 • Published 28 days ago • 7
Generative Omnimatte: Learning to Decompose Video into Layers Paper • 2411.16683 • Published 29 days ago • 1