Nitral-AI
/

Captain-Eris_Violet-GRPO-v0.420

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Nitral-AI commited on 1 day ago

Commit

422400a

·

verified ·

1 Parent(s): d536ef8

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -27,6 +27,10 @@ language:
 ## Training Notes: This model was developed using a combination of multi-stage supervised fine-tuning, pre-trained QLoRA adapters, and multi-stage RLHF optimized with GRPO. The final model was created by merging the most promising candidates identified during the process.
 # The following YAML configuration was used to produce this final version of the model:
 ```yaml
 slices:

 ## Training Notes: This model was developed using a combination of multi-stage supervised fine-tuning, pre-trained QLoRA adapters, and multi-stage RLHF optimized with GRPO. The final model was created by merging the most promising candidates identified during the process.
+## Series Comparison:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/642265bc01c62c1e4102dc36/aXNoJZ0oc-fU4xyZyBTmk.png)
 # The following YAML configuration was used to produce this final version of the model:
 ```yaml
 slices: