Haofan Wang

wanghaofan

AI & ML interests

Co-Founder&Researcher@InstantX

Recent Activity

Organizations

Stable Diffusion Dreambooth Concepts Library's profile picture ZeroGPU Explorers's profile picture InstantX's profile picture Social Post Explorers's profile picture Shakker Labs's profile picture

wanghaofan's activity

reacted to AdinaY's post with šŸ”„ 4 days ago
view post
Post
2399
Two AI startups, DeepSeek & Moonshot AI , keep moving in perfect sync šŸ‘‡

āœØ Last December: DeepSeek & Moonshot AI released their reasoning models on the SAME DAY.
DeepSeek: deepseek-ai/DeepSeek-R1
MoonShot: https://github.com/MoonshotAI/Kimi-k1.5

āœØ Last week: Both teams published papers on modifying attention mechanisms on the SAME DAY AGAIN.
DeepSeek: Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention (2502.11089)
Moonshot: MoBA: Mixture of Block Attention for Long-Context LLMs (2502.13189)

āœØ TODAY:
DeepSeek unveiled Flash MLA: a efficient MLA decoding kernel for NVIDIA Hopper GPUs, optimized for variable-length sequences.
https://github.com/deepseek-ai/FlashMLA

Moonshot AI introduces Moonlight: a 3B/16B MoE trained on 5.7T tokens using Muon, pushing the Pareto frontier with fewer FLOPs.
moonshotai/Moonlight-16B-A3B

What's next? šŸ‘€
New activity in CSU-JPG/TextAtlas5M 9 days ago

Update README.md

#3 opened 9 days ago by
wanghaofan
updated a Space 13 days ago
upvoted an article about 2 months ago
view article
Article

Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation

By RapidataAI and 5 others ā€¢
ā€¢ 13