Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hkust-nlp
's Collections
SimpleRL
PreSelect
CodeI/O
M-STAR
Deita
🎯DART-Math
SimpleRL
updated
10 days ago
The collection for the Project "Simple Reinforcement Learning for Reasoning"
Upvote
4
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
5 days ago
•
187
•
2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
5 days ago
•
188
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections