Update README.md
Browse files
README.md
CHANGED
@@ -1,13 +1,32 @@
|
|
1 |
---
|
2 |
datasets:
|
3 |
- baidu/TARA
|
|
|
|
|
|
|
|
|
4 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
# Model Description
|
6 |
|
7 |
Themis is a tool-augmented preference model to address these limitations by empowering RMs with access to external environments, including calculators and search engines.
|
8 |
-
It was introduced in
|
9 |
Themis-7b is trained with [TARA](https://huggingface.co/datasets/baidu/TARA), achieving a noteworthy overall improvement of 17.7% across eight tasks in preference ranking.
|
10 |
|
|
|
|
|
|
|
|
|
|
|
11 |
# Citation
|
12 |
```text
|
13 |
@inproceedings{tarm-2024-ernie,
|
|
|
1 |
---
|
2 |
datasets:
|
3 |
- baidu/TARA
|
4 |
+
license: mit
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
library_name: transformers
|
8 |
---
|
9 |
+
|
10 |
+
|
11 |
+
<a href="https://iclr.cc/Conferences/2024" target="_blank">
|
12 |
+
<img alt="ICLR 2024" src="https://img.shields.io/badge/Proceedings-ICLR2024-red" />
|
13 |
+
</a>
|
14 |
+
|
15 |
+
Offical checkpoint for [Tool-Augmented Reward Modeling (ICLR 2024 spotlight)](https://openreview.net/pdf?id=d94x0gWTUX).
|
16 |
+
|
17 |
+
|
18 |
+
|
19 |
# Model Description
|
20 |
|
21 |
Themis is a tool-augmented preference model to address these limitations by empowering RMs with access to external environments, including calculators and search engines.
|
22 |
+
It was introduced in the [ICLR 2024 paper](https://arxiv.org/pdf/2310.01045.pdf) and first released in this [repository](https://github.com/ernie-research/Tool-Augmented-Reward-Model).
|
23 |
Themis-7b is trained with [TARA](https://huggingface.co/datasets/baidu/TARA), achieving a noteworthy overall improvement of 17.7% across eight tasks in preference ranking.
|
24 |
|
25 |
+
## 🔥 News
|
26 |
+
* **9 February, 2024:** 🎉 We release the official codebase and model weights of [`baidu/Themis-7b`](https://huggingface.co/baidu/Themis-7b). Stay tuned!🔥
|
27 |
+
* **16 January, 2024:** 🎉 Our work has been accepted to [ICLR 2024](https://iclr.cc/Conferences/2024) **spotlight**! ✨
|
28 |
+
|
29 |
+
|
30 |
# Citation
|
31 |
```text
|
32 |
@inproceedings{tarm-2024-ernie,
|