Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
philschmid
/
qwen-2.5-3b-r1-countdown
like
6
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
r1
rl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
/
runs
Commit History
Training in progress, step 450
0288df2
verified
philschmid
commited on
12 days ago
Training in progress, step 425
3b18948
verified
philschmid
commited on
12 days ago
Training in progress, step 400
1039b38
verified
philschmid
commited on
12 days ago
Training in progress, step 375
1e22d39
verified
philschmid
commited on
12 days ago
Training in progress, step 350
eaf0bde
verified
philschmid
commited on
12 days ago
Training in progress, step 325
5872373
verified
philschmid
commited on
12 days ago
Training in progress, step 300
4bab7d2
verified
philschmid
commited on
12 days ago
Training in progress, step 250
ac27ceb
verified
philschmid
commited on
12 days ago
Training in progress, step 200
7bc26dd
verified
philschmid
commited on
12 days ago
Training in progress, step 175
04b9f34
verified
philschmid
commited on
12 days ago
Training in progress, step 150
02b892a
verified
philschmid
commited on
12 days ago
Training in progress, step 125
ae5d28b
verified
philschmid
commited on
12 days ago
Training in progress, step 100
2d5757d
verified
philschmid
commited on
12 days ago
Training in progress, step 75
9ab4dbf
verified
philschmid
commited on
12 days ago
Training in progress, step 50
7e5167b
verified
philschmid
commited on
12 days ago
Training in progress, step 25
b821786
verified
philschmid
commited on
12 days ago