Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AmberYifan
/
zephyr-7b-sft-safeDPO3
like
0
Text Generation
Transformers
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
AmberYifan/safetyQA_DPO
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
89b6f39
zephyr-7b-sft-safeDPO3
Commit History
Training in progress, step 300
def0c3f
verified
AmberYifan
commited on
May 8, 2024
Training in progress, step 200
c2a0981
verified
AmberYifan
commited on
May 8, 2024
Training in progress, step 100
61158ce
verified
AmberYifan
commited on
May 8, 2024
initial commit
058e465
verified
AmberYifan
commited on
May 8, 2024
Previous
1
2
Next