xiongwilee's picture

2 1

xiongwilee

xiongwilee

·

https://wilee.me

AI & ML interests

None yet

Recent Activity

upvoted an article 23 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted an article 8 months ago

How I train a LoRA: m3lt style training overview

liked a Space over 1 year ago

ysharma/Explore_llamav2_with_TGI

View all activity

Organizations

None yet

xiongwilee's activity

upvoted an article 23 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

28 days ago

• 60

upvoted an article 8 months ago

Article

How I train a LoRA: m3lt style training overview

By

•

Jul 1, 2024

• 49