Llama
Collection
Meta-based models
•
945 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Passthrough merge method using deepseek-ai/DeepSeek-R1-Distill-Llama-8B + mpasila/Llama-3.1-Literotica-LoRA-8B as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B+mpasila/Llama-3.1-Literotica-LoRA-8B
dtype: bfloat16
merge_method: passthrough
models:
- model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B+mpasila/Llama-3.1-Literotica-LoRA-8B
tokenizer_source: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 17.84 |
IFEval (0-Shot) | 18.85 |
BBH (3-Shot) | 19.23 |
MATH Lvl 5 (4-Shot) | 35.20 |
GPQA (0-shot) | 7.05 |
MuSR (0-shot) | 6.72 |
MMLU-PRO (5-shot) | 19.97 |