Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ENERGY-DRINK-LOVE
/
komt_DPOv3
like
0
Follow
ENERGY-DRINK-LOVE
9
Text Generation
Transformers
Safetensors
llama
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
arxiv:
2305.18290
License:
cc-by-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
komt_DPOv3
Commit History
Update README.md
b5d9eca
verified
jingyeom
commited on
Mar 16
Update README.md
c9c22f4
verified
jingyeom
commited on
Mar 16
Update README.md
4577d50
verified
jingyeom
commited on
Mar 16
End of training
5ae1a12
verified
genne
commited on
Mar 11
initial commit
a4038c8
verified
genne
commited on
Mar 11