PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
File size: 129 Bytes
76e1c78
b4c302f
76e1c78
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:745c2fb697b565af128ae973db368f2804bbdef3de8f5f13a0b41bcb19c0e276
size 6264