allura-org
/

Teleut-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Fizzarolli commited on Nov 24, 2024

Commit

7c9d7ff

·

verified ·

1 Parent(s): 7f5ea76

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -11,12 +11,12 @@ base_model: Qwen/Qwen2.5-7B
 A replication attempt of Tulu 3 on the Qwen 2.5 base models.
 ## Evals (so far)
-|                         | Teleut 7B (measured) | Tülu 3 SFT 8B (reported) | Qwen 2.5 7B Instruct (reported) | Ministral 8B | Mistral 7B v0.3 (reported)
-|-------------------------|----------------------|--------------------------|---------------------------------|--------------|---------------------------
-|IFEval (prompt loose)    |66.3%                 |72.8%                     |**74.7%**                        |56.4%         |53.0%
-|BBH (3 shot, CoT)        |64.4%                 |**67.9%**                 |21.7%                            |56.2%         |47.0%<sup>NLL</sup>
-|MMLU Pro (0 shot, CoT)   |xx.x%                 |xx.x%                     |56.3%<sup>Unknown</sup>          |xx.x%         |30.7%<sup>5-shot</sup>
-|AlpacaEval 2 (LC winrate)|xx.x%                 |12.4%                     |29.0%                            |31.4%         |xx.x%
 ## Credits
 Big thanks to Retis Labs for being providing my 8xH100 polycule used to train and test this model!

 A replication attempt of Tulu 3 on the Qwen 2.5 base models.
 ## Evals (so far)
+|                         | Teleut 7B (measured) | Tülu 3 SFT 8B (reported) | Qwen 2.5 7B Instruct (reported) | Ministral 8B (reported) | Mistral 7B v0.3 (reported)
+|-------------------------|----------------------|--------------------------|---------------------------------|-------------------------|---------------------------
+|IFEval (prompt loose)    |66.3%                 |72.8%                     |**74.7%**                        |56.4%                    |53.0%
+|BBH (3 shot, CoT)        |64.4%                 |**67.9%**                 |21.7%                            |56.2%                    |47.0%<sup>NLL</sup>
+|MMLU Pro (0 shot, CoT)   |xx.x%                 |xx.x%                     |56.3%<sup>Unknown</sup>          |xx.x%                    |30.7%<sup>5-shot</sup>
+|AlpacaEval 2 (LC winrate)|xx.x%                 |12.4%                     |29.0%                            |31.4%                    |xx.x%
 ## Credits
 Big thanks to Retis Labs for being providing my 8xH100 polycule used to train and test this model!