metadata
base_model:
- sthenno-com/miscii-14b-1225
- SicariusSicariiStuff/Impish_QWEN_14B-1M
- arcee-ai/Virtuoso-Small
- ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign
- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
- Qwen/Qwen2.5-14B-Instruct-1M
- sthenno/tempesthenno-nuslerp-0124
- sthenno/tempesthenno-0126-ckpt150
library_name: transformers
tags:
- mergekit
- merge
merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the SCE merge method using sthenno-com/miscii-14b-1225 as a base.
Models Merged
The following models were included in the merge:
- SicariusSicariiStuff/Impish_QWEN_14B-1M
- arcee-ai/Virtuoso-Small
- ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign
- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
- Qwen/Qwen2.5-14B-Instruct-1M
- sthenno/tempesthenno-nuslerp-0124
- sthenno/tempesthenno-0126-ckpt150
Configuration
The following YAML configuration was used to produce this model:
merge_method: sce
models:
- model: sthenno/tempesthenno-nuslerp-0124
- model: Qwen/Qwen2.5-14B-Instruct-1M
- model: sthenno/tempesthenno-0126-ckpt150
- model: arcee-ai/Virtuoso-Small
- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
- model: SicariusSicariiStuff/Impish_QWEN_14B-1M
- model: ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign
base_model: sthenno-com/miscii-14b-1225
parameters:
select_topk: 1.0
dtype: bfloat16
normalize: true