llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja5000 / model-00001-of-00006.safetensors

Commit History

(Trained with Unsloth)
2a0e4f6
verified

u-10bei commited on