FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published 20 days ago • 21
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Paper • 2410.14672 • Published 19 days ago • 7
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published 14 days ago • 14
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper • 2410.18666 • Published 13 days ago • 17
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published 9 days ago • 71