|
--- |
|
license: other |
|
license_name: llama-3 |
|
license_link: https://llama.meta.com/llama3/license/ |
|
--- |
|
Llama-3-11.5B-v2 |
|
|
|
Thank you to Meta for the weights for Meta-Llama-3-8B |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png) |
|
|
|
This is an upscaling of the Meta-Llama-3-8B Ai using techniques created for chargoddard/mistral-11b-slimorca. This Ai model has been upscaled from 8b parameters to 11.5b parameters without any continuous pretraining or fine-tuning. |
|
|
|
Unlike version 1 this model has no issues at fp16 or any quantizations. |
|
|
|
The model that was used to create this one is linked below: |
|
|
|
https://huggingface.co./meta-llama/Meta-Llama-3-8B |
|
|
|
|
|
- Llama-3-11.5B-V2 |
|
|
|
| Metric | Value | |
|
|---------------------------------|------:| |
|
| Avg. | 66.89 | |
|
| AI2 Reasoning Challenge(25-Shot)| 57.68 | |
|
| HellaSwag (10-Shot) | 78.59 | |
|
| MMLU (5-Shot) | 65.39 | |
|
| TruthfulQA (0-shot) | 35.86 | |
|
| Winogrande (5-shot) | 74.74 | |
|
| GSM8k (5-shot) | 69.37 | |
|
|
|
- Original Meta-Llama-3-8B |
|
|
|
| Metric |Value| |
|
|---------------------------------|----:| |
|
|Avg. |62.87| |
|
|AI2 Reasoning Challenge (25-Shot)|59.47| |
|
|HellaSwag (10-Shot) |82.09| |
|
|MMLU (5-Shot) |66.69| |
|
|TruthfulQA (0-shot) |43.90| |
|
|Winogrande (5-shot) |77.35| |
|
|GSM8k (5-shot) |45.34| |