jeffra commited on
Commit
8b82e1e
·
verified ·
1 Parent(s): 31ab9b3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -14,13 +14,15 @@ For more details about the technique
14
 
15
  ## Eval metrics
16
 
17
- | Model | Arc-Challenge | MMLU | MMLU-CoT | GSM-8k-CoT |
18
- |----------|--------------|--------------|--------------|--------------|
19
- | [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) | | | | |
20
- | [Snowflake/Llama-3.1-SwiftKV-8B-Instruct](https://huggingface.co/Snowflake/Llama-3.1-SwiftKV-8B-Instruct) | | | | |
21
- | [meta-llama/Llama-3.1-405B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct) | | | | |
22
- | [Snowflake/Llama-3.1-SwiftKV-405B-Instruct-FP8](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct-FP8) | | | | |
23
-
 
 
24
 
25
  ## How to use the models
26
 
 
14
 
15
  ## Eval metrics
16
 
17
+ | Llama-3.1-405B-Instruct-FP8 | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
18
+ |-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
19
+ | Baseline | 94.7 | 87.0 | 88.3 | 64.7 | 87.5 | 88.1 | 96.1 | **86.6** |
20
+ | 50% SingleInputKV | 94.0 | 86.3 | 88.1 | 64.2 | 85.7 | 87.5 | 95.2 | **85.9** |
21
+
22
+ | Llama-3.1-8B-Instruct | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
23
+ |-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
24
+ | Baseline | 82.00 | 77.90 | 80.40 | 54.56 | 67.90 | 70.63 | 82.56 | **73.71** |
25
+ | 50% SingleInputKV | 80.38 | 78.22 | 79.30 | 54.54 | 67.30 | 69.73 | 79.45 | **72.70** |
26
 
27
  ## How to use the models
28