dpo-llama-3-1-8b-math / trainer_state.json

Commit History

Model save
f39d639
verified

philschmid commited on