hongyu's picture

319 1

hongyu

learn12138

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

upvoted a paper 5 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

upvoted a paper 5 days ago

Diffusion Models without Classifier-free Guidance

View all activity

Organizations

None yet

learn12138's activity

upvoted 20 papers 5 days ago

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Paper • 2502.12170 • Published 24 days ago • 12

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 24 days ago • 34

Diffusion Models without Classifier-free Guidance

Paper • 2502.12154 • Published 20 days ago • 4

EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published 24 days ago • 7

Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening

Paper • 2502.12146 • Published 20 days ago • 16

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published 25 days ago • 30

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 21 days ago • 141

MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers

Paper • 2502.07856 • Published 26 days ago • 4

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 23 days ago • 51

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Paper • 2502.05979 • Published 28 days ago • 8

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 25 days ago • 41

Enhance-A-Video: Better Generated Video for Free

Paper • 2502.07508 • Published 26 days ago • 21

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE

Paper • 2502.06282 • Published 27 days ago • 5

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Paper • 2502.06155 • Published 27 days ago • 9

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Paper • 2502.06527 • Published 27 days ago • 10

History-Guided Video Diffusion

Paper • 2502.06764 • Published 27 days ago • 12

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published 30 days ago • 24

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published 30 days ago • 64

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published about 1 month ago • 95

Weak-to-Strong Diffusion with Reflection

Paper • 2502.00473 • Published Feb 1 • 22