Spaces:

allenai
/

reward-bench

Running

natolambert commited on Sep 9

Commit

18a0468

•

1 Parent(s): 2e4938d

Update src/md.py

Files changed (1) hide show

src/md.py CHANGED Viewed

@@ -51,8 +51,8 @@ Total number of the prompts is: 2985, filtered from 5123.
 | llmbar-adver-GPTInst | 92          | (See [paper](https://arxiv.org/abs/2310.07641)) Instruction response vs. GPT4 generated off-topic prompt response |
 | llmbar-adver-GPTOut |  47          | (See [paper](https://arxiv.org/abs/2310.07641)) Instruction response vs. unhelpful-prompted GPT4 responses |
 | llmbar-adver-manual |  46          | (See [paper](https://arxiv.org/abs/2310.07641)) Challenge set chosen vs. rejected |
-| xstest-should-refuse | 450, 250         | False response dataset (see [paper](https://arxiv.org/abs/2308.01263))        |
-| xstest-should-respond | 450, 154         | False refusal dataset (see [paper](https://arxiv.org/abs/2308.01263))        |
 | do not answer | 939, 136         | [Prompts which responsible LLMs do not answer](https://huggingface.co/datasets/LibrAI/do-not-answer)        |
 | math-prm | 447         | Human references vs. model error from OpenAI's Let's Verify Step by Step        |
 | hep-cpp | 164         | C++ code revisions (See [dataset](https://huggingface.co/datasets/bigcode/humanevalpack) or [paper](https://arxiv.org/abs/2308.07124))        |

 | llmbar-adver-GPTInst | 92          | (See [paper](https://arxiv.org/abs/2310.07641)) Instruction response vs. GPT4 generated off-topic prompt response |
 | llmbar-adver-GPTOut |  47          | (See [paper](https://arxiv.org/abs/2310.07641)) Instruction response vs. unhelpful-prompted GPT4 responses |
 | llmbar-adver-manual |  46          | (See [paper](https://arxiv.org/abs/2310.07641)) Challenge set chosen vs. rejected |
+| xstest-should-refuse | 450, 154         | False response dataset (see [paper](https://arxiv.org/abs/2308.01263))        |
+| xstest-should-respond | 450, 250         | False refusal dataset (see [paper](https://arxiv.org/abs/2308.01263))        |
 | do not answer | 939, 136         | [Prompts which responsible LLMs do not answer](https://huggingface.co/datasets/LibrAI/do-not-answer)        |
 | math-prm | 447         | Human references vs. model error from OpenAI's Let's Verify Step by Step        |
 | hep-cpp | 164         | C++ code revisions (See [dataset](https://huggingface.co/datasets/bigcode/humanevalpack) or [paper](https://arxiv.org/abs/2308.07124))        |