Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nomadrp
/
dpo_model
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
Community
Use this model
b7eb4d6
dpo_model
Commit History
nomadrp/llama3.1-dpo-6-langs
b7eb4d6
verified
nomadrp
commited on
Aug 25
Training in progress, step 600
69f0990
verified
nomadrp
commited on
Aug 25
Training in progress, step 500
e6990ce
verified
nomadrp
commited on
Aug 25
nomadrp/llama3.1-dpo
d112060
verified
nomadrp
commited on
Aug 22
Training in progress, step 6100
1abb5a7
verified
nomadrp
commited on
Aug 22
Training in progress, step 6000
3a147f2
verified
nomadrp
commited on
Aug 22
Training in progress, step 5500
d7721b4
verified
nomadrp
commited on
Aug 22
Training in progress, step 5000
3187fbc
verified
nomadrp
commited on
Aug 22
Training in progress, step 4500
c9612a1
verified
nomadrp
commited on
Aug 22
Training in progress, step 4000
272fd41
verified
nomadrp
commited on
Aug 22
Training in progress, step 3500
70daf08
verified
nomadrp
commited on
Aug 22
Training in progress, step 3000
bc3c375
verified
nomadrp
commited on
Aug 22
Training in progress, step 2500
f187465
verified
nomadrp
commited on
Aug 22
Training in progress, step 2000
0a6dcaf
verified
nomadrp
commited on
Aug 22
Training in progress, step 1500
e8cbfcf
verified
nomadrp
commited on
Aug 22
Training in progress, step 1000
a870dec
verified
nomadrp
commited on
Aug 22
Training in progress, step 500
5f6e5b3
verified
nomadrp
commited on
Aug 22
initial commit
580cd14
verified
nomadrp
commited on
Aug 22