Model Overview
This model has been fine-tuned using the The Second Iteration of GSM8K Preference Dataset with the Direct Preference Optimization algorithm. The goal of this fine-tuning is to enhance the model's performance in solving arithmetic problems on the GSM8K benchmark.
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.