README.md · Replete-AI/Llama-3-11.5B-V2 at d46025354061728ce70ba3ae286b03dbb493ea4a

metadata

license: other
license_name: llama-3
license_link: https://llama.meta.com/llama3/license/

Llama-3-11.5B-v2

Thank you to Meta for the weights for Meta-Llama-3-8B

This is an upscaling of the Meta-Llama-3-8B Ai using techniques created for chargoddard/mistral-11b-slimorca. This Ai model has been upscaled from 8b parameters to 11.5b parameters without any continuous pretraining or fine-tuning.

Unlike version 1 this model has no issues at fp16 or any quantizations.

The model that was used to create this one is linked below:

https://huggingface.co./meta-llama/Meta-Llama-3-8B

Llama-3-11.5B-V2

Metric	Value
Avg.	66.89
AI2 Reasoning Challenge(25-Shot)	57.68
HellaSwag (10-Shot)	78.59
MMLU (5-Shot)	65.39
TruthfulQA (0-shot)	35.86
Winogrande (5-shot)	74.74
GSM8k (5-shot)	69.37

Original Meta-Llama-3-8B

Metric	Value
Avg.	62.87
AI2 Reasoning Challenge (25-Shot)	59.47
HellaSwag (10-Shot)	82.09
MMLU (5-Shot)	66.69
TruthfulQA (0-shot)	43.90
Winogrande (5-shot)	77.35
GSM8k (5-shot)	45.34