kenhktsui
/

Qwen-0.5B-GRPO-gsm8k-count-wait-cap-cross-correct

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen-0.5B-GRPO-gsm8k-count-wait-cap-cross-correct / vocab.json

kenhktsui's picture

kenhktsui/Qwen-0.5b-GRPO-keyword-with-reward-cap-cross-correct

bcd21f9 verified 11 days ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.