lewtun
/

gemma-7b-dpo-full-mix2-beta-0.1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

gemma-7b-dpo-full-mix2-beta-0.1 / runs

Commit History

Training in progress, step 400

a2c5252
verified

lewtun HF staff commited on Feb 29

Training in progress, step 300

5e2ae58
verified

lewtun HF staff commited on Feb 29

Training in progress, step 200

bb98807
verified

lewtun HF staff commited on Feb 29

Training in progress, step 100

60626ac
verified

lewtun HF staff commited on Feb 29