infly
/

INF-ORM-Llama3.1-70B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MinghaoYang commited on 22 days ago

Commit

14be646

·

verified ·

1 Parent(s): c4e1e6b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ pipeline_tag: text-classification
 # INF Outcome Reward Model
 ## Introduction
-[**INF-ORM-Llama3.1-70B**](https://huggingface.co/Skywork/Skywork-Reward-Gemma-2-27B-v0.2) is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K](https://huggingface.co/datasets/infly/INF-ORM-Preference-Magnitude-80K).
 We did the following three things to improve the performance of our model.
 ### Data Pre-processing

 # INF Outcome Reward Model
 ## Introduction
+[**INF-ORM-Llama3.1-70B**] is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K](https://huggingface.co/datasets/infly/INF-ORM-Preference-Magnitude-80K).
 We did the following three things to improve the performance of our model.
 ### Data Pre-processing