ELYZA-japanese-Llama-2-MoE-2x7B-v0.1 / mergekit_moe_config.yml
Aratako
model upload
9b4b652
raw
history blame
313 Bytes
base_model: ./ELYZA-japanese-Llama-2-7b-instruct
gate_mode: random
dtype: bfloat16
experts:
- source_model: ./ELYZA-japanese-Llama-2-7b-instruct
positive_prompts: []
- source_model: ./ELYZA-japanese-Llama-2-7b
positive_prompts: []
tokenizer_source: model:./ELYZA-japanese-Llama-2-7b-instruct