Qwen2-1.5B-GRPO-demo / config.json

Commit History

Training in progress, step 10
0f6e05a
verified

longlian commited on