llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 / model-00005-of-00006.safetensors

Commit History

(Trained with Unsloth)
23a2359
verified

u-10bei commited on