XueyingJia
/

qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch

Commit History

Training in progress, step 600

3319608
verified

XueyingJia commited on Dec 10, 2024

Training in progress, step 500

20f563c
verified

XueyingJia commited on Dec 10, 2024

Training in progress, step 400

8e01d14
verified

XueyingJia commited on Dec 10, 2024

Training in progress, step 300

db31a73
verified

XueyingJia commited on Dec 10, 2024

Training in progress, step 200

918fec5
verified

XueyingJia commited on Dec 10, 2024

Training in progress, step 100

8576728
verified

XueyingJia commited on Dec 10, 2024

initial commit

98dd5f7
verified

XueyingJia commited on Dec 10, 2024