The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 2 days ago • 45
[MASK] is All You Need Collection Code, dataset, and pretrained model • 5 items • Updated Nov 29, 2024 • 8
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 29 days ago • 125
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Paper • 2412.01064 • Published Dec 2, 2024 • 25
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published Oct 14, 2024 • 29
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Paper • 2410.02416 • Published Oct 3, 2024 • 26
Loradex Highlights Collection This collection features awesome opensource LoRAs trained by members of the Glif Community during Loradex Early Access! • 14 items • Updated Oct 18, 2024 • 19
Emu3 Collection Emu3: Next-Token Prediction is All You Need • 5 items • Updated about 7 hours ago • 67
view article Article Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face By andreagagliano • Sep 6, 2024 • 16
view article Article Enhancing Image Model Dreambooth Training Through Effective Captioning: Key Observations By alvdansen • Jun 19, 2024 • 17
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • Aug 26, 2024 • 40
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Paper • 2408.06072 • Published Aug 12, 2024 • 37
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 118
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts Paper • 2408.03209 • Published Aug 6, 2024 • 22
Scaling Diffusion Transformers to 16 Billion Parameters Paper • 2407.11633 • Published Jul 16, 2024 • 25