3 6 308

richmao PRO

deter3

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

SynthLabsAI/Big-Math-RL-Verified

liked a dataset 11 days ago

lmms-lab/multimodal-open-r1-8k-verified

liked a model 11 days ago

lmms-lab/Qwen2-VL-2B-GRPO-8k

View all activity

Organizations

deter3's activity

liked a dataset 3 days ago

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated 2 days ago • 251k • 1.3k • 85

liked a dataset 11 days ago

lmms-lab/multimodal-open-r1-8k-verified

Viewer • Updated Jan 27 • 7.69k • 4.11k • 41

liked 2 models 11 days ago

lmms-lab/Qwen2-VL-2B-GRPO-8k

Updated Jan 28 • 913 • 11

apple/mobilevit-small

Image Classification • Updated 4 days ago • 660k • • 57

liked 2 models 14 days ago

mkurman/Llama-3.2-MedIT-3B-R1

Updated 14 days ago • 86 • 1

mkurman/Qwen2.5-14B-DeepSeek-R1-1M

Text Generation • Updated Jan 27 • 8.29k • 50

liked a dataset 15 days ago

open-r1/OpenR1-Math-Raw

Viewer • Updated 4 days ago • 516k • 1.68k • 70

liked a model 18 days ago

tomg-group-umd/huginn-0125

Text Generation • Updated 5 days ago • 8.79k • 236

liked a dataset 19 days ago

phihung/titanic

Viewer • Updated Jun 22, 2022 • 891 • 343 • 4

liked a model 19 days ago

ValueFX9507/Tifa-Deepsex-14b-CoT

Reinforcement Learning • Updated 15 days ago • 70.6k • 183

upvoted an article 19 days ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

liked a dataset 19 days ago

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated 9 days ago • 483k • 2.13k • 26