sequelbox commited on
Commit
b66f8f8
1 Parent(s): 0b48e7c

eval format

Browse files
Files changed (1) hide show
  1. README.md +0 -12
README.md CHANGED
@@ -188,16 +188,4 @@ We care about open source.
188
  For everyone to use.
189
 
190
  We encourage others to finetune further from our models.
191
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
192
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ValiantLabs__Llama3.2-3B-Enigma)
193
-
194
- | Metric |Value|
195
- |-------------------|----:|
196
- |Avg. |15.77|
197
- |IFEval (0-Shot) |47.75|
198
- |BBH (3-Shot) |18.81|
199
- |MATH Lvl 5 (4-Shot)| 6.65|
200
- |GPQA (0-shot) | 1.45|
201
- |MuSR (0-shot) | 4.54|
202
- |MMLU-PRO (5-shot) |15.41|
203
 
 
188
  For everyone to use.
189
 
190
  We encourage others to finetune further from our models.
 
 
 
 
 
 
 
 
 
 
 
 
191