Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published 12 days ago • 23
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 26 days ago • 32
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published about 1 month ago • 87
timm/vit_medium_patch16_reg4_gap_256.sbb_in12k_ft_in1k Image Classification • Updated 19 days ago • 965 • 2
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper • 2412.17153 • Published Dec 22, 2024 • 34
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published Dec 23, 2024 • 24
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published Dec 12, 2024 • 20
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published Dec 12, 2024 • 23
Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction Paper • 2411.14762 • Published Nov 22, 2024 • 11