MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Paper • 2502.04235 • Published 5 days ago • 16
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior Paper • 2407.07580 • Published Jul 10, 2024 • 1
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper • 2501.16764 • Published 14 days ago • 21
EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion Paper • 2501.13452 • Published 19 days ago • 7
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 21 days ago • 49
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 28 days ago • 32
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 28 days ago • 32
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated 2 days ago • 29