merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the SLERP merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
slices:
- sources:
- model: "chargoddard/qwen2p5-14b-llamatok"
layer_range: [0, 48]
- model: "large-traversaal/Qwen-2.5-14B-Hindi"
layer_range: [0, 48]
merge_method: slerp
base_model: "chargoddard/qwen2p5-14b-llamatok"
parameters:
t:
- filter: lm_head
value: [0.55]
- filter: embed_tokens
value: [0.7]
- filter: self_attn
value: [0.65, 0.35]
- filter: mlp
value: [0.35, 0.65]
- filter: layernorm
value: [0.4, 0.6]
- filter: modelnorm
value: [0.6]
- value: 0.5
dtype: bfloat16
- Downloads last month
- 6
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.