Replete-AI
/

Llama-3-11.5B-V2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3-11.5B-V2 / README.md

rombodawg's picture

Update README.md

d460253 verified 4 months ago

|

No virus

1.53 kB

	---
	license: other
	license_name: llama-3
	license_link: https://llama.meta.com/llama3/license/
	---
	Llama-3-11.5B-v2

	Thank you to Meta for the weights for Meta-Llama-3-8B

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png)

	This is an upscaling of the Meta-Llama-3-8B Ai using techniques created for chargoddard/mistral-11b-slimorca. This Ai model has been upscaled from 8b parameters to 11.5b parameters without any continuous pretraining or fine-tuning.

	Unlike version 1 this model has no issues at fp16 or any quantizations.

	The model that was used to create this one is linked below:

	https://huggingface.co./meta-llama/Meta-Llama-3-8B


	- Llama-3-11.5B-V2

	\| Metric \| Value \|
	\|---------------------------------\|------:\|
	\| Avg. \| 66.89 \|
	\| AI2 Reasoning Challenge(25-Shot)\| 57.68 \|
	\| HellaSwag (10-Shot) \| 78.59 \|
	\| MMLU (5-Shot) \| 65.39 \|
	\| TruthfulQA (0-shot) \| 35.86 \|
	\| Winogrande (5-shot) \| 74.74 \|
	\| GSM8k (5-shot) \| 69.37 \|

	- Original Meta-Llama-3-8B

	\| Metric \|Value\|
	\|---------------------------------\|----:\|
	\|Avg. \|62.87\|
	\|AI2 Reasoning Challenge (25-Shot)\|59.47\|
	\|HellaSwag (10-Shot) \|82.09\|
	\|MMLU (5-Shot) \|66.69\|
	\|TruthfulQA (0-shot) \|43.90\|
	\|Winogrande (5-shot) \|77.35\|
	\|GSM8k (5-shot) \|45.34\|