PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
File size: 134 Bytes
188731d
ce4716f
188731d
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:354821e69540b351b24489a28233e4aa0ee7ce0d1f56bf3615df9d19993428c7
size 323014168