Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
huiwonLee
/
dpo_v4_reseve_v1
like
0
PEFT
Safetensors
llama
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
c84efaf
dpo_v4_reseve_v1
Commit History
Upload LlamaForCausalLM
c84efaf
verified
huiwonLee
commited on
Apr 11
Upload config
c519304
verified
huiwonLee
commited on
Apr 11
End of training
81a8812
verified
huiwonLee
commited on
Apr 11
Training in progress, epoch 1
9f10cc0
verified
huiwonLee
commited on
Apr 11
initial commit
73f01bf
verified
huiwonLee
commited on
Apr 9