cyk1337 commited on
Commit
76fc47b
·
verified ·
1 Parent(s): f21c558

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -1,13 +1,32 @@
1
  ---
2
  datasets:
3
  - baidu/TARA
 
 
 
 
4
  ---
 
 
 
 
 
 
 
 
 
 
5
  # Model Description
6
 
7
  Themis is a tool-augmented preference model to address these limitations by empowering RMs with access to external environments, including calculators and search engines.
8
- It was introduced in this [paper](https://arxiv.org/pdf/2310.01045.pdf) and first released in this [repository](https://github.com/ernie-research/Tool-Augmented-Reward-Model).
9
  Themis-7b is trained with [TARA](https://huggingface.co/datasets/baidu/TARA), achieving a noteworthy overall improvement of 17.7% across eight tasks in preference ranking.
10
 
 
 
 
 
 
11
  # Citation
12
  ```text
13
  @inproceedings{tarm-2024-ernie,
 
1
  ---
2
  datasets:
3
  - baidu/TARA
4
+ license: mit
5
+ language:
6
+ - en
7
+ library_name: transformers
8
  ---
9
+
10
+
11
+ <a href="https://iclr.cc/Conferences/2024" target="_blank">
12
+ <img alt="ICLR 2024" src="https://img.shields.io/badge/Proceedings-ICLR2024-red" />
13
+ </a>
14
+
15
+ Offical checkpoint for [Tool-Augmented Reward Modeling (ICLR 2024 spotlight)](https://openreview.net/pdf?id=d94x0gWTUX).
16
+
17
+
18
+
19
  # Model Description
20
 
21
  Themis is a tool-augmented preference model to address these limitations by empowering RMs with access to external environments, including calculators and search engines.
22
+ It was introduced in the [ICLR 2024 paper](https://arxiv.org/pdf/2310.01045.pdf) and first released in this [repository](https://github.com/ernie-research/Tool-Augmented-Reward-Model).
23
  Themis-7b is trained with [TARA](https://huggingface.co/datasets/baidu/TARA), achieving a noteworthy overall improvement of 17.7% across eight tasks in preference ranking.
24
 
25
+ ## 🔥 News
26
+ * **9 February, 2024:** 🎉 We release the official codebase and model weights of [`baidu/Themis-7b`](https://huggingface.co/baidu/Themis-7b). Stay tuned!🔥
27
+ * **16 January, 2024:** 🎉 Our work has been accepted to [ICLR 2024](https://iclr.cc/Conferences/2024) **spotlight**! ✨
28
+
29
+
30
  # Citation
31
  ```text
32
  @inproceedings{tarm-2024-ernie,