richmao PRO

deter3

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago
SynthLabsAI/Big-Math-RL-Verified
liked a dataset 11 days ago
lmms-lab/multimodal-open-r1-8k-verified
liked a model 11 days ago
lmms-lab/Qwen2-VL-2B-GRPO-8k
View all activity

Organizations

Chinese LLMs on Hugging Face's profile picture

deter3's activity

upvoted an article 19 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other •
• 24