Satori-reasoning
/

Satori-7B-Round2

@@ -1,5 +1,18 @@
 ---
 license: apache-2.0
 datasets:
 - Satori-reasoning/Satori_FT_data
 - Satori-reasoning/Satori_RL_data
@@ -112,7 +125,7 @@ Satori-7B-Round2 achieves SOTA performance and outperforms Qwen-2.5-Math-7B-Inst
 |           | OpenMath2-Llama3.1-8B     | 90.5 | 67.8 | 28.9 | 37.5 | 6.7  | 46.3 |
 |           | NuminaMath-7B-CoT         | 78.9 | 54.6 | 15.9 | 20.0 | 10.0 | 35.9 |
 |           | Qwen-2.5-7B-Instruct      | 91.6 | 75.5 | 35.5 | 52.5 | 6.7  | 52.4 |
-|           | Qwen-2.5-Math-7B-Instruct |95.2  | 83.6 | 41.6 | 62.5 | 16.7 | 59.9 |
 |           | **Satori-7B-Round2**  | 93.9 | 83.6 | 48.5 | 72.5 | 23.3 | **64.4** |
 ### **General Domain Reasoning Benchmarks**
@@ -140,6 +153,8 @@ Please refer to our blog and research paper for more technical details of Satori
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
  - [Paper](https://arxiv.org/pdf/2502.02508)
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```

 ---
 license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
+datasets:
+- Satori-reasoning/Satori_FT_data
+- Satori-reasoning/Satori_RL_data
+base_model:
+- Qwen/Qwen2.5-Math-7B
+---
+---
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
 datasets:
 - Satori-reasoning/Satori_FT_data
 - Satori-reasoning/Satori_RL_data
 |           | OpenMath2-Llama3.1-8B     | 90.5 | 67.8 | 28.9 | 37.5 | 6.7  | 46.3 |
 |           | NuminaMath-7B-CoT         | 78.9 | 54.6 | 15.9 | 20.0 | 10.0 | 35.9 |
 |           | Qwen-2.5-7B-Instruct      | 91.6 | 75.5 | 35.5 | 52.5 | 6.7  | 52.4 |
+|           | Qwen-2.5-Math-7B-Instruct | 95.2 | 83.6 | 41.6 | 62.5 | 16.7 | 59.9 |
 |           | **Satori-7B-Round2**  | 93.9 | 83.6 | 48.5 | 72.5 | 23.3 | **64.4** |
 ### **General Domain Reasoning Benchmarks**
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
  - [Paper](https://arxiv.org/pdf/2502.02508)
+For code, see https://github.com/Satori-reasoning/Satori
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```