Merge2-Llama-3.1-8B
Merge2-Llama-3.1-8B is a merge of the following models using mergekit:
🧩 Configuration
'''yaml slices:
- sources:
- model: NCSOFT/Llama-VARCO-8B-Instruct layer_range: [0, 32]
- model: sh2orc/Llama-3.1-Korean-8B-Instruct layer_range: [0, 32] merge_method: slerp base_model: NCSOFT/Llama-VARCO-8B-Instruct parameters: t:
- filter: self_attn value: [0, 0.5, 0.3, 0.7, 1]
- filter: mlp value: [1, 0.5, 0.7, 0.3, 0]
- value: 0.5 dtype: bfloat16 '''
- Downloads last month
- 20