Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
chchen
/
Mistral-7B-Instruct-v0.3-ORPO-SALT-HALF
like
0
PEFT
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
ef2828f
Mistral-7B-Instruct-v0.3-ORPO-SALT-HALF
Commit History
Training in progress, step 1500
ef2828f
verified
chchen
commited on
Jul 26
Training in progress, step 1000
c10ec6a
verified
chchen
commited on
Jul 26
Training in progress, step 500
231f5d4
verified
chchen
commited on
Jul 26
initial commit
f8e99c5
verified
chchen
commited on
Jul 26