Snowflake
/

Llama-3.1-SwiftKV-8B-Instruct

Model card Files Files and versions Community

jeffra commited on Dec 5, 2024

Commit

8b82e1e

·

verified ·

1 Parent(s): 31ab9b3

Update README.md

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -14,13 +14,15 @@ For more details about the technique
 ## Eval metrics
-| Model | Arc-Challenge | MMLU | MMLU-CoT | GSM-8k-CoT |
-|----------|--------------|--------------|--------------|--------------|
-| [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) | | | | |
-| [Snowflake/Llama-3.1-SwiftKV-8B-Instruct](https://huggingface.co/Snowflake/Llama-3.1-SwiftKV-8B-Instruct) | | | | |
-| [meta-llama/Llama-3.1-405B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct) | | | | |
-| [Snowflake/Llama-3.1-SwiftKV-405B-Instruct-FP8](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct-FP8) | | | | |
 ## How to use the models

 ## Eval metrics
+| Llama-3.1-405B-Instruct-FP8 | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
+|-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
+| Baseline | 94.7 | 87.0 | 88.3 | 64.7 | 87.5 | 88.1 | 96.1 | **86.6** |
+| 50% SingleInputKV | 94.0 | 86.3 | 88.1 | 64.2 | 85.7 | 87.5 | 95.2 | **85.9** |
+| Llama-3.1-8B-Instruct | Arc Challenge | Winogrande | HellaSwag | TruthfulQA | MMLU | MMLU cot | GSM8K | Avg |
+|-----------|---------------|------------|-----------|------------|------|----------|-------|-----|
+| Baseline | 82.00 | 77.90 | 80.40 | 54.56 | 67.90 | 70.63 | 82.56 | **73.71** |
+| 50% SingleInputKV | 80.38 | 78.22 | 79.30 | 54.54 | 67.30 | 69.73 | 79.45 | **72.70** |
 ## How to use the models