weqweasdas
commited on
Commit
•
3f8102a
1
Parent(s):
fc1057c
Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,7 @@
|
|
8 |
|
9 |
The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
|
10 |
|
|
|
11 |
|
12 |
## Model Details
|
13 |
|
|
|
8 |
|
9 |
The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
|
10 |
|
11 |
+
The training script is available at https://github.com/WeiXiongUST/RLHF-Reward-Modeling .
|
12 |
|
13 |
## Model Details
|
14 |
|