ryanmarten
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ The dataset is derived by distilling DeepSeek-R1 using the [data pipeline availa
|
|
26 |
More info about the dataset can be found on the dataset card at [OpenThoughts-114k dataset](https://huggingface.co/datasets/open-thoughts/open-thoughts-114k).
|
27 |
|
28 |
This model improves upon the [Bespoke-Stratos-7B model](https://huggingface.co/bespokelabs/Bespoke-Stratos-7B), which used 17k examples ([Bespoke-Stratos-17k dataset](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k)).
|
29 |
-
The numbers reported in the table below are
|
30 |
|
31 |
| | AIME2024 | MATH500 | GPQA-Diamond | LCB Easy v2 | LCB Medium v2 | LCB Hard v2 | LCB All v2 |
|
32 |
| --------------------------- | -------- | ------- | ------------ | ----------- | ------------- | ----------- | ---------- |
|
|
|
26 |
More info about the dataset can be found on the dataset card at [OpenThoughts-114k dataset](https://huggingface.co/datasets/open-thoughts/open-thoughts-114k).
|
27 |
|
28 |
This model improves upon the [Bespoke-Stratos-7B model](https://huggingface.co/bespokelabs/Bespoke-Stratos-7B), which used 17k examples ([Bespoke-Stratos-17k dataset](https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k)).
|
29 |
+
The numbers reported in the table below are evaluated with our open-source tool [Evalchemy](https://github.com/mlfoundations/Evalchemy).
|
30 |
|
31 |
| | AIME2024 | MATH500 | GPQA-Diamond | LCB Easy v2 | LCB Medium v2 | LCB Hard v2 | LCB All v2 |
|
32 |
| --------------------------- | -------- | ------- | ------------ | ----------- | ------------- | ----------- | ---------- |
|