Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
glenn2
/
gemma-7b-lora-distilabel-intel-orca-dpo-pairs
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
mit
Model card
Files
Files and versions
Community
1
Use this model
refs/pr/1
gemma-7b-lora-distilabel-intel-orca-dpo-pairs
/
training_args.bin
Commit History
argilla/gemma-7b-lora-distilabel-intel-orca-dpo-pairs
1c6fac2
verified
glenn2
commited on
Feb 24