Tess, short for Tesoro (Treasure in Italian), is a general purpose Large Language Model series. Tess-72B-v1.5b was trained on the Qwen-72B base.

Prompt Format:

SYSTEM: <ANY SYSTEM CONTEXT>
USER: 
ASSISTANT:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	77.30
AI2 Reasoning Challenge (25-Shot)	71.25
HellaSwag (10-Shot)	85.53
MMLU (5-Shot)	76.63
TruthfulQA (0-shot)	71.99
Winogrande (5-shot)	81.45
GSM8k (5-shot)	76.95

Downloads last month: 45

Safetensors

Model size

72.3B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for migtissera/Tess-72B-v1.5b

Merges

1 model

Quantizations

2 models

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

71.250
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

85.530
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

76.630
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

71.990
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

81.450
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

76.950

View on Papers With Code