MinghaoYang commited on
Commit
14be646
·
verified ·
1 Parent(s): c4e1e6b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ pipeline_tag: text-classification
17
  # INF Outcome Reward Model
18
  ## Introduction
19
 
20
- [**INF-ORM-Llama3.1-70B**](https://huggingface.co/Skywork/Skywork-Reward-Gemma-2-27B-v0.2) is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K](https://huggingface.co/datasets/infly/INF-ORM-Preference-Magnitude-80K).
21
 
22
  We did the following three things to improve the performance of our model.
23
  ### Data Pre-processing
 
17
  # INF Outcome Reward Model
18
  ## Introduction
19
 
20
+ [**INF-ORM-Llama3.1-70B**] is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K](https://huggingface.co/datasets/infly/INF-ORM-Preference-Magnitude-80K).
21
 
22
  We did the following three things to improve the performance of our model.
23
  ### Data Pre-processing