Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
LeeSB
/
zephyr-7b-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
72e2779
zephyr-7b-dpo-qlora
/
runs
/
Mar10_00-42-29_mosaic-cirrascale-37.reviz.ai2.in
Commit History
Training in progress, step 500
72e2779
verified
LeeSB
commited on
Mar 11
Training in progress, step 400
69de067
verified
LeeSB
commited on
Mar 10
Training in progress, step 300
aa4f6fc
verified
LeeSB
commited on
Mar 10
Training in progress, step 200
5b92faa
verified
LeeSB
commited on
Mar 10
Training in progress, step 100
03e6f6c
verified
LeeSB
commited on
Mar 10