Wonderland: Navigating 3D Scenes from a Single Image Paper • 2412.12091 • Published 27 days ago • 15
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published Dec 12, 2024 • 20
Efficient Training with Denoised Neural Weights Paper • 2407.11966 • Published Jul 16, 2024 • 8
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Paper • 2406.04333 • Published Jun 6, 2024 • 37
TextCraftor: Your Text Encoder Can be Image Quality Controller Paper • 2403.18978 • Published Mar 27, 2024 • 13
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29, 2024 • 32
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Paper • 2402.14797 • Published Feb 22, 2024 • 20
AToM: Amortized Text-to-Mesh using 2D Diffusion Paper • 2402.00867 • Published Feb 1, 2024 • 10
LightSpeed: Light and Fast Neural Light Fields on Mobile Devices Paper • 2310.16832 • Published Oct 25, 2023 • 4
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Paper • 2310.08579 • Published Oct 12, 2023 • 15
R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis Paper • 2203.17261 • Published Mar 31, 2022 • 1
Rethinking Vision Transformers for MobileNet Size and Speed Paper • 2212.08059 • Published Dec 15, 2022 • 4
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Paper • 2306.17843 • Published Jun 30, 2023 • 43
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Paper • 2306.00980 • Published Jun 1, 2023 • 14