Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
chchen
/
Mistral-7B-Instruct-v0.3-ORPO-SALT-HALF
like
0
PEFT
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
Mistral-7B-Instruct-v0.3-ORPO-SALT-HALF
/
trainer_log.jsonl
Commit History
Model save
ec3c136
verified
chchen
commited on
Jul 26
Training in progress, step 1500
ef2828f
verified
chchen
commited on
Jul 26
Training in progress, step 1000
c10ec6a
verified
chchen
commited on
Jul 26
Training in progress, step 500
231f5d4
verified
chchen
commited on
Jul 26