45 23 41

Zhen Li

Paper99

https://paper99.github.io/

AI & ML interests

image/video restoration and enhancement, generation, and editing

Recent Activity

liked a model 5 days ago

yuxi-liu-wired/CSD

upvoted a paper 8 days ago

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

liked a Space 9 days ago

Alpha-VLLM/Lumina-Image-2.0

View all activity

Organizations

Paper99's activity

upvoted a paper 8 days ago

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published 8 days ago • 12

upvoted a paper 10 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 13 days ago • 40

upvoted a paper 20 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 21 days ago • 106

upvoted 2 papers 26 days ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 27 days ago • 63

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published 26 days ago • 15

upvoted 2 papers 2 months ago

Wonderland: Navigating 3D Scenes from a Single Image

Paper • 2412.12091 • Published Dec 16, 2024 • 16

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

upvoted a paper 3 months ago

Pathways on the Image Manifold: Image Editing via Video Generation

Paper • 2411.16819 • Published Nov 25, 2024 • 33

upvoted 4 papers 4 months ago

VidPanos: Generative Panoramic Videos from Casual Panning Videos

Paper • 2410.13832 • Published Oct 17, 2024 • 13

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 34

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 93

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

upvoted a paper 5 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

upvoted a collection 5 months ago

Emu3

Collection

Emu3: Next-Token Prediction is All You Need • 7 items • Updated 6 days ago • 68

upvoted a paper 6 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

upvoted a paper 9 months ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 102

upvoted 2 papers 12 months ago

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 61

Sora Generates Videos with Stunning Geometrical Consistency

Paper • 2402.17403 • Published Feb 27, 2024 • 17

upvoted a collection about 1 year ago

PhotoMaker

Collection

Let us create photos/paintings/avatars for anyone in any style within seconds. • 5 items • Updated Jul 22, 2024 • 26

upvoted a paper about 1 year ago

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Paper • 2312.04461 • Published Dec 7, 2023 • 62