base_model: | |
- meta-llama/Llama-3.2-3B-Instruct | |
library_name: transformers | |
tags: | |
- mergekit | |
- merge | |
license: apache-2.0 | |
language: | |
- en | |
# Process Reward Model - 3B | |
This is a math based PRM. Experimental. |
base_model: | |
- meta-llama/Llama-3.2-3B-Instruct | |
library_name: transformers | |
tags: | |
- mergekit | |
- merge | |
license: apache-2.0 | |
language: | |
- en | |
# Process Reward Model - 3B | |
This is a math based PRM. Experimental. |