-
brownvc/BaselineGAN-CIFAR10
Unconditional Image Generation • Updated • 1 -
brownvc/BaselineGAN-FFHQ-64x64
Unconditional Image Generation • Updated • 1 -
brownvc/BaselineGAN-FFHQ-256x256
Unconditional Image Generation • Updated • 2 -
brownvc/BaselineGAN-ImgNet-32x32
Unconditional Image Generation • Updated • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2501.05441
-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 91 -
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Paper • 2501.01257 • Published • 45 -
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Paper • 2501.01423 • Published • 34 -
REDUCIO! Generating 1024times1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Paper • 2411.13552 • Published
-
GenEx: Generating an Explorable World
Paper • 2412.09624 • Published • 88 -
IamCreateAI/Ruyi-Mini-7B
Image-to-Video • Updated • 17.1k • 576 -
Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation
Paper • 2412.06016 • Published • 20 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 85
-
Depth Anything V2
Paper • 2406.09414 • Published • 95 -
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels
Paper • 2406.09415 • Published • 50 -
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
Paper • 2406.04338 • Published • 34 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 111