llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja2000 / model-00004-of-00006.safetensors

Commit History

(Trained with Unsloth)
10e65c4
verified

u-10bei commited on