Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nomadrp
/
dpo_model
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
Community
Use this model
50db9bd
dpo_model
/
adapter_config.json
Commit History
Training in progress, step 500
f460670
verified
nomadrp
commited on
Aug 26
Training in progress, step 500
e6990ce
verified
nomadrp
commited on
Aug 25
Training in progress, step 500
5f6e5b3
verified
nomadrp
commited on
Aug 22