Jiang Jiwen

jjw0126
·

AI & ML interests

RL, LLM

Recent Activity

liked a dataset 1 day ago
adyen/DABstep
liked a dataset 1 day ago
ibm-granite/GneissWeb
View all activity

Organizations

ucas's profile picture ELM Team's profile picture PLM-Team's profile picture

jjw0126's activity

upvoted 2 articles 10 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

782
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
45