Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
chchen
/
Gemma-2B-It-ORPO-SALT
like
0
PEFT
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
License:
gemma
Model card
Files
Files and versions
Community
Use this model
6f18883
Gemma-2B-It-ORPO-SALT
Commit History
Training in progress, step 1000
6f18883
verified
chchen
commited on
Aug 15
Training in progress, step 500
fc23b1c
verified
chchen
commited on
Aug 15
initial commit
8801f58
verified
chchen
commited on
Aug 15