File size: 1,052 Bytes
3e2ec37
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8d8ea50
3e2ec37
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Wandb runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/s0qdwbg6?workspace=user-yongzx


Evaluation results:
|    Task     |Version|Filter| Metric |Value |   |Stderr|
|-------------|-------|------|--------|-----:|---|-----:|
|arc_challenge|Yaml   |none  |acc     |0.1758|±  |0.0111|
|             |       |none  |acc_norm|0.2176|±  |0.0121|
|arc_easy     |Yaml   |none  |acc     |0.3742|±  |0.0099|
|             |       |none  |acc_norm|0.3565|±  |0.0098|
|logiqa       |Yaml   |none  |acc     |0.2058|±  |0.0159|
|             |       |none  |acc_norm|0.2412|±  |0.0168|
|piqa         |Yaml   |none  |acc     |0.5958|±  |0.0114|
|             |       |none  |acc_norm|0.5941|±  |0.0115|
|sciq         |Yaml   |none  |acc     |0.5930|±  |0.0155|
|             |       |none  |acc_norm|0.5720|±  |0.0157|
|winogrande   |Yaml   |none  |acc     |0.5154|±  |0.0140|
|wsc |Yaml   |none  |acc   |0.3654|±  |0.0474|
|lambada_openai|Yaml   |none  |perplexity|730.2552|±  |46.8739|
|              |       |none  |acc       |  0.1316|±  | 0.0047|