Update README.md
Browse files
README.md
CHANGED
@@ -14,13 +14,15 @@ For more details about the technique
|
|
14 |
|
15 |
## Eval metrics
|
16 |
|
17 |
-
|
|
18 |
-
|
19 |
-
|
|
20 |
-
|
|
21 |
-
|
22 |
-
|
|
23 |
-
|
|
|
|
|
24 |
|
25 |
## How to use the models
|
26 |
|
|
|
14 |
|
15 |
## Eval metrics
|
16 |
|
17 |
+
| Llama-3.1-405B-Instruct-FP8 | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
|
18 |
+
|-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
|
19 |
+
| Baseline | 94.7 | 87.0 | 88.3 | 64.7 | 87.5 | 88.1 | 96.1 | **86.6** |
|
20 |
+
| 50% SingleInputKV | 94.0 | 86.3 | 88.1 | 64.2 | 85.7 | 87.5 | 95.2 | **85.9** |
|
21 |
+
|
22 |
+
| Llama-3.1-8B-Instruct | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
|
23 |
+
|-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
|
24 |
+
| Baseline | 82.00 | 77.90 | 80.40 | 54.56 | 67.90 | 70.63 | 82.56 | **73.71** |
|
25 |
+
| 50% SingleInputKV | 80.38 | 78.22 | 79.30 | 54.54 | 67.30 | 69.73 | 79.45 | **72.70** |
|
26 |
|
27 |
## How to use the models
|
28 |
|