Nitral-AI commited on
Commit
422400a
·
verified ·
1 Parent(s): d536ef8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -27,6 +27,10 @@ language:
27
 
28
  ## Training Notes: This model was developed using a combination of multi-stage supervised fine-tuning, pre-trained QLoRA adapters, and multi-stage RLHF optimized with GRPO. The final model was created by merging the most promising candidates identified during the process.
29
 
 
 
 
 
30
  # The following YAML configuration was used to produce this final version of the model:
31
  ```yaml
32
  slices:
 
27
 
28
  ## Training Notes: This model was developed using a combination of multi-stage supervised fine-tuning, pre-trained QLoRA adapters, and multi-stage RLHF optimized with GRPO. The final model was created by merging the most promising candidates identified during the process.
29
 
30
+ ## Series Comparison:
31
+
32
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/642265bc01c62c1e4102dc36/aXNoJZ0oc-fU4xyZyBTmk.png)
33
+
34
  # The following YAML configuration was used to produce this final version of the model:
35
  ```yaml
36
  slices: