Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JayHyeon
/
Qwen2.5-0.5B-SFT-2e-5-2ep-MDPO_0.5_5e-7-10ep_0alp_0lam
like
0
Text Generation
Transformers
Safetensors
trl-lib/ultrafeedback_binarized
qwen2
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
7ce47fd
Qwen2.5-0.5B-SFT-2e-5-2ep-MDPO_0.5_5e-7-10ep_0alp_0lam
Commit History
Training in progress, step 9700
7ce47fd
verified
JayHyeon
commited on
Jan 4
Training in progress, step 9500
cfd4639
verified
JayHyeon
commited on
Jan 4
Training in progress, step 9000
2560b24
verified
JayHyeon
commited on
Jan 4
Training in progress, step 8500
72c524e
verified
JayHyeon
commited on
Jan 4
Training in progress, step 8000
63ffe62
verified
JayHyeon
commited on
Jan 4
Training in progress, step 7500
a6f63fe
verified
JayHyeon
commited on
Jan 4
Training in progress, step 7000
702403b
verified
JayHyeon
commited on
Jan 4
Training in progress, step 6500
43c83d5
verified
JayHyeon
commited on
Jan 4
Training in progress, step 6000
78ce35a
verified
JayHyeon
commited on
Jan 4
Training in progress, step 5500
f74a79e
verified
JayHyeon
commited on
Jan 4
Training in progress, step 5000
91b5f71
verified
JayHyeon
commited on
Jan 4
Training in progress, step 4500
5ebf287
verified
JayHyeon
commited on
Jan 4
Training in progress, step 4000
f7f93d4
verified
JayHyeon
commited on
Jan 4
Training in progress, step 3000
453209e
verified
JayHyeon
commited on
Jan 4
Training in progress, step 2500
96049ba
verified
JayHyeon
commited on
Jan 4
Training in progress, step 2000
e0b3bc6
verified
JayHyeon
commited on
Jan 4
Training in progress, step 1500
0d66f3c
verified
JayHyeon
commited on
Jan 4
Training in progress, step 1000
607db37
verified
JayHyeon
commited on
Jan 4
Training in progress, step 500
a30cc5a
verified
JayHyeon
commited on
Jan 4
initial commit
5aa51d0
verified
JayHyeon
commited on
Jan 4