mistralit2_250_STEPS_1e7_rate_0.1_beta_DPO / model-00003-of-00003.safetensors

Commit History

End of training
81b175c
verified

tsavage68 commited on