Model Summary
s1 is a reasoning model finetuned from Qwen2.5-32B-Instruct on just 1,000 examples. It matches o1-preview & exhibits test-time scaling via budget forcing.
- Repository: simplescaling/s1
- Paper: TODO
Use
The model usage is documented here.
Citation
TODO
- Downloads last month
- 434
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model's library.