PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer

Commit History

End of training
cfca1d8
verified

khongtrunght commited on

Model save
844f260
verified

khongtrunght commited on

Training in progress, step 1091
e303bc6
verified

khongtrunght commited on

Training in progress, step 1000
1bc1667
verified

khongtrunght commited on

Training in progress, step 900
93c94a5
verified

khongtrunght commited on

Training in progress, step 800
879542f
verified

khongtrunght commited on

Training in progress, step 700
b538dc9
verified

khongtrunght commited on

Training in progress, step 600
e91d931
verified

khongtrunght commited on

Training in progress, step 500
587a4a1
verified

khongtrunght commited on

Training in progress, step 400
223bf41
verified

khongtrunght commited on

Training in progress, step 300
f823698
verified

khongtrunght commited on

Training in progress, step 200
8a1137c
verified

khongtrunght commited on

Training in progress, step 100
d6f7852
verified

khongtrunght commited on

initial commit
f31002c
verified

khongtrunght commited on